News and random (but not uniformly distributed) thoughts about Fil, NaN, CNetS, divA, Informatics & Computing…
Congratulations to Lilian Weng, who successfully defended her Informatics PhD dissertation titled Information diffusion on online social networks. The thesis provides insights into information diffusion on online social networks from three aspects: people who share information, features of transmissible content, and the mutual effects between network structure and diffusion process. The first part delves into the limited human attention. The second part of Dr. Weng’s dissertation investigates properties of transmissible content, particularly into the topic space. Finally, the thesis presents studies of how network structure, particularly community structure, influences the propagation of Internet memes and how the information flow in turn affects social link formation. Dr. Weng’s work can contribute to a better and more comprehensive understanding of information diffusion among online social-technical systems and yield applications to viral marketing, advertisement, and social media analytics. Congratulations from her colleagues and committee members: Alessandro Flammini, YY Ahn, Steve Myers, and Fil Menczer!
On August 11, 2013, the New York Times published an article by Ian Urbina with the headline: I Flirt and Tweet. Follow Me at #Socialbot. The article reports on how socialbots (software simulating people on social media) are being designed to sway elections, to influence the stock market, even to flirt with people and one another. Fil Menczer is quoted: “Bots are getting smarter and easier to create, and people are more susceptible to being fooled by them because we’re more inundated with information.” The article also mentions the Truthy project and some of our 2010 findings on political astroturf.
Inspired by this, the writers of The Good Wife consulted with us on an episode in which the main character finds that a social news site is using a socialbot to bring traffic to the site, defaming her client. The episode aired on November 24, 2013, on CBS (Season 5 Episode 9, “Whack-a-Mole”). Good show!
ACM, the professional association of computer scientists and computing professionals, announced today that I was named a Distinguished Scientist. Here is the list of other ACM members who got this award. This is a great honor and I am grateful. But my thanks go especially to my many amazing collaborators (colleagues, postdocs, visiting scholars, and especially students) without whom my contributions and impact would not exist — this award is also yours!
And while I am bragging, let me also mention that I was recently named a Senior Research Fellow of The Kinsey Institute for Research in Sex, Gender, and Reproduction. This is another great honor and I am excited about our team’s collaboration with the Kinsey Institute on the Kinsey Reporter project. The Kinsey Institute has an awesome tradition of trailblazing research and I hope that we can make a small contribution to it. Thanks to both the Kinsey Reporter team and our Kinsey collaborators!
A story in Nature discusses a recent paper (preprint) from CNetS members Jasleen Kaur, Filippo Radicchi and Fil Menczer on the universality of scholarly impact metrics. In the paper, we present a method to quantify the disciplinary bias of any scholarly impact metric. We use the method to evaluate a number of established scholarly impact metrics. We also introduce a simple universal metric that allows to compare the impact of scholars across scientific disciplines. Mohsen JafariAsbagh integrated this metric into Scholarometer, a crowdsourcing system developed by our group to collect and share scholarly impact data. The Nature story highlight how one can use normalized impact metrics to rank all scholars, as illustrated in the widget shown here.
We are excited to announce that the ACM Web Science 2014 Conference will be hosted by our center on the beautiful IUB campus June 23–26, 2014. Web Science studies the vast information network of people, communities, organizations, applications, and policies that shape and are shaped by the Web, the largest artifact constructed by humans in history. Computing, physical, and social sciences come together, complementing each other in understanding how the Web affects our interactions and behaviors. Previous editions of the conference were held in Athens, Raleigh, Koblenz, Evanston, and Paris. The conference is organized on behalf of the Web Science Trust by general co-chairs Fil Menczer, Jim Hendler, and Bill Dutton. Follow us on Twitter and see you in Bloomington!
Findings by CNetS researchers on social media indicators of election results received significant coverage in the national press. The paper More Tweets, More Votes: Social Media as a Quantitative Indicator of Political Behavior by Joseph Digrazia, Karissa McKelvey, Johan Bollen, and Fabio Rojas was presented at the 2013 Meeting of the American Sociological Association in NYC. It was covered by NPR, The Wall Street Journal, MSNBC, C-SPAN, The Washington Post, The Atlantic, and many other media.
Congratulations to Przemyslaw Grabowicz, Luca Aiello, and Fil Menczer for winning the WICI Data Challenge. A prize of $10,000 CAD accompanies this award from the Waterloo Institute for Complexity and Innovation at the University of Waterloo. The Challenge called for tools and methods that improve the exploration, analysis, and visualization of complex-systems data. The winning entry, titled Fast visualization of relevant portions of large dynamic networks, is an algorithm that selects subsets of nodes and edges that best represent an evolving graph and visualizes it either by creating a movie, or by streaming it to an interactive network visualization tool. The algorithm is deployed in the movie generation tool of the Truthy system, which allows users to create, in near-real time, YouTube videos that illustrate the spread and co-occurrence of memes on Twitter. Przemek and Luca worked on this project while visiting CNetS in 2011 and collaborating with the Truthy team. Bravo!
On June 27, 2013, in Turin, within the celebrations of the Lagrange Prize and ISI Foundation’s 30th anniversary, Fil Menczer and nine other scientists were named ISI Fellows. The recognition is a tribute to researchers whose scientific contribution is of primary importance for the Institute. The official investiture took place on June 28th, 2013, during the conference The Being of Science, which highlighted the ways in which the Fellows’ research fields intertwine with the ISI scientific activities.
UPDATE: With legal review completed, we re-launched Kinsey Reporter V.2!
CNetS, in collaboration with The Kinsey Institute, has released Kinsey Reporter, a global mobile survey platform for collecting and sharing anonymous data about sexual and other intimate behaviors. The pilot project allows citizen observers around the world to use free applications now available for Apple and Android mobile platforms to not only report on sexual behavior and experiences, but also to share, explore and visualize the accumulated data.
This new platform will allow us to explore issues that have been challenging to study until now, such as the prevalence of unreported sexual violence in different parts of the world, or the correlation between various sexual practices like condom use, for example, and the cultural, political, religious or health contexts in particular geographical areas.
The Kinsey Institute’s longstanding seminal studies of sexual behaviors created a perfect synergy with research going on at CNetS related to mining big data crowd-sourced from mobile social media. The sensitive domain — sexual relations — added an intriguing challenge in finding a way to share useful data with the community while protecting the privacy and anonymity of the reporting volunteers.
To foster the study of the structure and dynamics of Web traffic networks, we are making available to the research community a large Click Dataset of
13 53.5 billion HTTP requests collected at Indiana University. Between 2006 and 2010, our system generated data at a rate of about 60 million requests per day, or about 30 GB/day of raw data. We hope that this data will help develop a better understanding of user behavior online and create more realistic models of Web traffic. The potential applications of this data include improved designs for networks, sites, and server software; more accurate forecasting of traffic trends; classification of sites based on the patterns of activity they inspire; and improved ranking algorithms for search results.