The National Institutes of Health, under the National Library of Medicine’s program on data science research, awarded a $1.55 million grant to an interdisciplinary team lead by Luis Rocha, a professor of informatics, member of CNETS and the director of the NSF-NRT complex networks & systems program at the School of Informatics, Computing, and Engineering. The four-year project, a collaboration between SICE and the Indiana University School of Nursing, will employ innovative data- and network-science methods to produce myAURA, an easy-to-use web service for epilepsy patients. myAURA will be based on a large-scale epilepsy knowledge graph built by integrating data from social media, electronic health records, patient discussion boards, scientific literature databases, advocacy websites, and mobile app data. The knowledge graph will, in turn, be used to fuel recommendation and visualization algorithms based on the automatic inference of relevant associations. The inference will follow algorithms developed by Rocha’s team to remove redundancy and extract factual information from large knowledge graphs as well as parsimonious network visualizations developed by Katy Börner, Distinguished Professor of Engineering & Information Science at SICE. Continue reading Team led by Rocha awarded NIH grant to improve chronic-disease management with Data and Network Science
Traditional fact checking by expert journalists cannot keep up with the enormous volume of information that is now generated online. Computational fact checking may significantly enhance our ability to evaluate the veracity of dubious information.
We have shown that the complexities of fact checking can be captured by finding the shortest path between concept nodes under properly defined semantic proximity metrics on knowledge graphs. Framed as a network problem this approach is feasible with efficient computational techniques. Such approaches can be designed to not only be scalable and effective at assessing veracity of dubious claims, but also to boost a human fact checker’s productivity by surfacing relevant facts and patterns to aid their analysis. We evaluated this approach by examining tens of thousands of claims related to history, entertainment, geography, and biographical information using a public knowledge graph extracted from Wikipedia. Statements independently known to be true consistently receive higher support via our method than do false ones.
We also developed a network-flow based approach. We showed that computational fact checking amounts to finding a “knowledge stream” that emanates from a subject node and flows toward an object node through paths connecting them. Evaluation on a range of real-world and hand-crafted datasets of facts related to entertainment, business, sports, geography and more reveals that this network-flow model can be very effective in discerning true statements from false ones, outperforming existing algorithms on many test cases. Moreover, the model is expressive in its ability to automatically discover several useful path patterns and surface relevant facts that may help a human fact checker corroborate or refute a claim.
These findings represent a significant step toward scalable computational fact-checking methods that may one day mitigate the spread of harmful misinformation.