Check out the Great Minds Think Alike game for iPhone, iPad and iPod Touch. It is a word association game that lets users build semantic concept networks and explore similarity relations between people and media content. Starting from a single word, the player builds a chain of words by selecting semantically related terms. Get the game from the App Store!
NaN had a strong presence at Hypertext 2009 in Torino:
- Mark’s paper What’s in a session: tracking individual behavior on the web was nominated for the Best Paper Award.
- Heather presented the demo Incentives for social annotation about the prototype Firefox extension for GiveALink.org, and the tagging game. (Heather is also demoing at SIGIR’09 in Boston.)
- I presented the demo Sixearch.org 2.0 peer application for collaborative web search about the latest release of Sixearch.
- Ben presented his poster on A scalable, collaborative similarity measure for social annotation systems.
I gave four invited talks in Spain, Italy, and Switzerland this summer:
- June 18: Social similarity at Yahoo! Labs Barcelona (host: Ricardo Baeza-Yates) — this is where I got the idea of a foosball table in the lab…
- June 19: Dynamics of Online Popularity at the University of Barcelona’s Department of Fundamental Physics (host: Marian Boguna)
- June 25: Social similarity at DEI, Politecnico di Milano (host: Stefano Ceri)
- June 26: Modeling text generation at the Faculty of Informatics, University of Lugano (host: Fabio Crestani)
Thanks to my wonderful hosts and their groups for engaging discussions and delightful hospitality!
Research and Creativity Activity profiles research by CNetS faculty Filippo Menczer and Alessandro Vespignani and their groups in a special issue on networks. More…
Link analysis algorithms leverage hyperlinks created by authors as semantic endorsements between pages, while social bookmarks provide a way to leverage annotations by information consumers as a source of information about pages. This project explores a novel approach that is a synergy of the two: soliciting annotations from users about the content of pages, in a way that implicitly forms networks of relationships between and among resources and tags. These socially generated relationships are then aggregated to build bottom-up, global semantic similarity networks. Algorithms are developed to construct, analyze, and mine these networks in support of search and recommendation applications, exploratory navigation interfaces, resource management utilities, tag spam detection, and incentive games to accelerate the achievement of critical mass.
To extrapolate both annotations about content (tags) and semantic relationships (similarity) from single users to the “wisdom of the crowd,” the project investigates an information-theoretic model that extracts semantic assessments from information structures that many users are already maintaining, namely the bookmarks and tags they manage on their browsers or online. This entails the design and evaluation of several network-based measures and algorithms, such as similarity, novelty, centrality, and focus. Among the aims of this model are the exploration of the duality between resources (URLs) and concepts (tags or categories) and the integration of social annotation and collaborative filtering. One way to provide users with immediate value is to integrate client-based taxonomies and server-based folksonomies for social bookmark management. Both traditional users of browser bookmarks and social users of online bookmarks can take advantage of the same semantic maps while retaining the convenience of intuitive browser interfaces and centralized storage.
Strategic collaborations to share data, accelerate evaluation, and maximize impact are under way with key groups in Europe through the TAGora Project and its partners at Rome Sapienza, Sony Paris, the ISI Foundation in Torino, and the BibSonomy group at Kassel University. GiveALink.org (supported by a wonderful computing and storage infrastructure) is an open social bookmarking platform developed to experiment with and demonstrate the ideas of this project. The algorithms and data generated by the project are made available to the Web community to facilitate analysis, the development of improved network algorithms, and integration with other Internet applications. Early results of this project have been presented at various conferences and workshops including LinkKDD2005, AAAI2006, and HT2008. More recent publications are listed below. To learn more, donate your bookmarks, play with our system, and download our data and applications please visit GiveALink.org.
Collaborators & Alumni:
We should also acknowledge Todd Holloway for his contributions to the early search engine; Luis Rocha and Ana Maguitman for suggesting the idea of ranking and searching by novelty; Mark Meiss, who thought of the catchy name for GiveALink; and Rob Henderson, quite possibly the greatest sysadmin around.
|This project is supported by the National Science Foundation under award IIS-0811994: Social Integration of Semantic Annotation Networks for Web Applications. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.|
I will be using Tuesday (3/24) as a practice talk for AIRWEB 2009. Hope everyone had a nice spring break!
Title: Social Spam Detection
The popularity of social bookmarking sites has made them prime targets for spammers. Many of these systems require an administrator’s time and energy to manually filter or remove spam. Here we discuss the motivations ofsocial spam, and present a study of automatic detection of spammers in a social tagging system. We identify and analyze six distinct features that address various properties of social spam, finding that each of these features provides for a helpful signal to discriminate spammers from legitimate users. These features are then used in various machine learning
algorithms for classification, achieving over 98% accuracy in detecting social spammers with 2% false positives. These promising results provide a new baseline for future efforts on social spam. We make our dataset publicly
available to the research community.
No, it’s not an Italian spin-off of the popular TV show. CSI Piemonte is organizing a meeting on Understanding Complexity: a Journey through Science to be held November 22-23 at the Lingotto Convention Center here in Torino. We will have demos and posters on 6S, GiveALink, and the egalitarian effect of search engines. I look forward in particular to seeing my good old friend Dario and my mentor, Domenico.