Fil will present the paper Folks in folksonomies: Social link prediction from shared metadata (authored with Rossano Schifanella, Alain Barrat, Ciro Cattuto, and Ben Markines) at WSDM 2010 in New York on February 5. The paper discusses homophily, or more specifically the relationship between social connections and social tagging in folksonomies. We show that social similarity measures based on annotations can be effective predictors of friendship relationships. For the occasion, we are making our Last.fm dataset publicly available.
Fil Menczer is one of the organizers of Hypertext 2009, the 20th ACM Conference on Hypertext an Hypermedia. The conference will be held June 29-July 1 at the Villa Gualino Convention Centre, on the hills overlooking Torino, Italy. Hypertext is the main venue for high quality peer-reviewed research on “linking.” The Web, the Semantic Web, the Web 2.0, and Social Networks are all manifestations of the success of the link. With a 70% increase in submissions, Hypertext 2009 will have a strong and diverse technical program covering all research concerning links: their semantics, their presentation, the applications, as well as the knowledge that can be derived from their analysis and their effects on society. The conference will also feature demos, posters, a student research competition, four workshops, and keynotes by Lada Adamic and Ricardo Baeza-Yates.
Research and Creativity Activity profiles research by CNetS faculty Filippo Menczer and Alessandro Vespignani and their groups in a special issue on networks. More…
UPDATE: As of 2015 the GiveAlink project has been archived and the GiveALink.org website is no longer operational.
Link analysis algorithms leverage hyperlinks created by authors as semantic endorsements between pages, while social bookmarks provide a way to leverage annotations by information consumers as a source of information about pages. This project explores a novel approach that is a synergy of the two: soliciting annotations from users about the content of pages, in a way that implicitly forms networks of relationships between and among resources and tags. These socially generated relationships are then aggregated to build bottom-up, global semantic similarity networks. Algorithms are developed to construct, analyze, and mine these networks in support of search and recommendation applications, exploratory navigation interfaces, resource management utilities, tag spam detection, and incentive games to accelerate the achievement of critical mass.
To extrapolate both annotations about content (tags) and semantic relationships (similarity) from single users to the “wisdom of the crowd,” the project investigates an information-theoretic model that extracts semantic assessments from information structures that many users are already maintaining, namely the bookmarks and tags they manage on their browsers or online. This entails the design and evaluation of several network-based measures and algorithms, such as similarity, novelty, centrality, and focus. Among the aims of this model are the exploration of the duality between resources (URLs) and concepts (tags or categories) and the integration of social annotation and collaborative filtering. One way to provide users with immediate value is to integrate client-based taxonomies and server-based folksonomies for social bookmark management. Both traditional users of browser bookmarks and social users of online bookmarks can take advantage of the same semantic maps while retaining the convenience of intuitive browser interfaces and centralized storage.
Strategic collaborations to share data, accelerate evaluation, and maximize impact are under way with key groups in Europe through the TAGora Project and its partners at Rome Sapienza, Sony Paris, the ISI Foundation in Torino, and the BibSonomy group at Kassel University. GiveALink.org (supported by a wonderful computing and storage infrastructure) is an open social bookmarking platform developed to experiment with and demonstrate the ideas of this project. The algorithms and data generated by the project are made available to the Web community to facilitate analysis, the development of improved network algorithms, and integration with other Internet applications. Early results of this project have been presented at various conferences and workshops including LinkKDD2005, AAAI2006, and HT2008. More recent publications are listed below. To learn more, donate your bookmarks, play with our system, and download our data and applications please visit GiveALink.org.
Collaborators & Alumni:
We should also acknowledge Todd Holloway for his contributions to the early search engine; Luis Rocha and Ana Maguitman for suggesting the idea of ranking and searching by novelty; Mark Meiss, who thought of the catchy name for GiveALink; and Rob Henderson, quite possibly the greatest sysadmin around.
|This project is supported by the National Science Foundation under award IIS-0811994: Social Integration of Semantic Annotation Networks for Web Applications. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.|
Fil Menczer recently received a $500,000 grant from the National Science Foundation to investigate the Social Integration of Semantic Annotation Networks for Web Applications. The project brings together complex networks and Web mining techniques to develop a new generation of search engines and collaborative Web applications such as GiveALink.org. The researchers will leverage existing annotations from users (such as the bookmarks they already maintain on their browsers) and elicit new ones through useful tools and games. The research will lead to a framework for building and maintaining socio-semantic networks of relationships between, and among, users, tags, and Web sites. In the end, these networks will improve social Web applications such as search, recommendation, spam detection, and exploratory navigation interfaces. More…