Data Mining

The process of analysing large datasets in order to extract usable data and find patterns.

Related Publications:

  • [WebSci2009] Victor, Patricia, Cornelis, Chris, De Cock, Martine, Teredesai, Ankur - Trust- and Distrust-Based Recommendations for Controversial Reviews - http://www.websci09.org
  • [WebSci2009] Man Au Yeung, Ching, Noll, Michael, Gibbins, Nicholas, Meinel, Christoph, Shadbolt, Nigel - On Measuring Expertise in Collaborative Tagging Systems. - http://www.websci09.org/
  • [WebSci2010] Gaffney, Devin - #iranElection: quantifying online activism. - http://www.websci10.org/
  • [WebSci2010] Mustafaraj, Eni, Metaxas, Panagiotis - From Obscurity to Prominence in Minutes: Political Speech and Real-Time Search - http://www.websci10.org/
  • [WebSci2010] Au Yeung, Ching Man - Analysis of Strategies for Item Discovery in Social Sharing on the Web. - http://www.websci10.org/
  • [WebSci2010] Finin, Tim, Syed, Zareen, Mulwad, Varish, Joshi, Anupam - Exploiting a Web of Semantic Data for Interpreting Tables. - http://www.websci10.org/
  • [WebSci2010] Möller, Knud, Hausenblas, Michael, Cyganiak, Richard, Grimnes, Gunnar, Handschuh, Siegfried - Learning from Linked Open Data Usage: Patterns & Metrics. - http://www.websci10.org/
  • [WebSci2011] Janaína Gomide, Adriano Veloso, Wagner Meira Jr, Fabrício Benevenuto, Virgílio Almeida, Fernanda Ferraz, Mauro Teixeira - Dengue surveillance based on a computational model of spatio-temporal locality of Twitter. - http://www.websci11.org/
  • [WebSci2011] Fabian Flöck, Denny Vrandecic, Elena Simperl - Towards a diversity-minded Wikipedia. - http://www.websci11.org/
  • [WebSci2011] Nasir Naveed, Thomas Gottron, Jérôme Kunegis, Arifah Che Alhadi - Bad News Travel Fast: A Content-based Analysis of Interestingness on Twitter. - http://www.websci11.org/
  • [WebSci2011] Pascal Juergens, Andreas Jungherr, Harald Schoen - Small Worlds with a Difference: New Gatekeepers and the Filtering of Political Information on Twitter. - http://www.websci11.org/
  • [WebSci2012] Trevor Collins, Paul Mulholland, Annika Wolff - Web supported emplotment: Using object and event descriptions to facilitate storytelling online and in galleries - https://dl.acm.org/citation.cfm?id=2380728
  • [WebSci2012] Mingyan Gao, Vivek K. Singh, Ramesh Jain - EventShop: From Heterogeneous Web Streams to Personalized Situation Detection and Control - https://dl.acm.org/citation.cfm?id=2380733
  • [WebSci2012] Pushmeet Kohli, Yoram Bachrach, David Stillwell, Michael Kearns, Ralf Herbrich, Thore Graepel - Colonel Blotto On Facebook: The Effect of Social Relations On Strategic Interaction - https://dl.acm.org/citation.cfm?id=2380738
  • [WebSci2012] Jaimie Y. Park, Chin-Wan Chung - When Daily Deal Services Meet Twitter: Understanding Twitter as a Daily Deal Marketing Platform - https://dl.acm.org/citation.cfm?id=2380748
  • [WebSci2012] Ernesto Diaz-Aviles, Avar´ e Stewart - Tracking Twitter for Epidemic Intelligence - http://www.websci12.org/
  • [WebSci2012] Chun-Yuen Teng, Liuling Gong, Avishay Livne EECS, Celso Brunetti, Lada Adamic - Coevolution of Network Structure and Content - http://www.websci12.org/
  • [WebSci2012] Ingmar Weber, Venkata Rama Kiran Garimella, Erik Borra - MiningWeb Query Logs to Analyze Political Issues - http://www.websci12.org/
  • [WebSci2012] JianWu, Pradeep Teregowda, Juan Pablo Fernandez Ramırez, Prasenjit Mitra, Shuyi Zheng, C. Lee Giles - The Evolution of a Crawling Strategy for an Academic Document Search Engine: Whitelists and Blacklists - http://www.websci12.org/
  • [WebSci2013] Paul Gaskell, Frank McGroarty, Thanassis Tiropanis - An Investigation into Correlations between Financial Sentiment and Prices in Financial Markets - http://www.websci13.org/
  • [WebSci2013] Mathieu d’Aquin, Alessandro Adamou, Stefan Dietze - Assessing the Educational Linked Data Landscape - http://www.websci13.org/
  • [WebSci2013] Wouter van Atteveldt, Tamir Sheafer, Shaul Shenhav - Automatically Extracting Frames from Media Content using Syntacting Analysis - http://www.websci13.org/
  • [WebSci2013] Souneil Park, Minsam Ko, Jaeung Lee, Aram Choi, Junehwa Song - Challenges and Opportunities of Local Journalism: A Case Study of the 2012 Korean General Election - http://www.websci13.org/
  • [WebSci2013] Richard Rogers - Debanalizing Twitter: The Transformation of an Object of Study - http://www.websci13.org/
  • [WebSci2013] Catherine C. Marshall, Frank M. Shipman - Experiences Surveying the Crowd: Reflections on Methods, Participation, and Reliability - http://www.websci13.org/
  • [WebSci2013] Anca Dumitrache, Paul Groth, Peter van den Besselaar - Identifying Research Talent Using Web-Centric Databases - http://www.websci13.org/
  • [WebSci2013] Daniel Preo, tiuc-Pietro, Trevor Coh - Mining User Behaviours: A Study of Check-in Patterns in Location Based Social Networks - http://www.websci13.org/
  • [WebSci2013] Jerome Kunegis, Marcel Blattner, Christine Moser - Preferential Attachment in Online Networks: Measurement and Explanations - http://www.websci13.org/
  • [WebSci2013] Derek Greene, Padraig Cunningham - Producing a Unified Graph Representation from Multiple Social - http://www.websci13.org/
  • [WebSci2013] Shu Huang, Wei Peng, Jingxuan Li, Dongwon Lee - Sentiment and Topic Analysis on Social Media: A Multi-Task Multi-Label Classification Approach - http://www.websci13.org/
  • [WebSci2013] Yuqing Lu, Lei Zhang, Yudong Xiao, Yangguang Li - Simultaneously Detecting Fake Reviews and Review Spammers using Factor Graph Model - http://www.websci13.org/
  • [WebSci2013] Munmun De Choudhury, Scott Counts, Eric Horvitz - Social Media as a Measurement Tool of Depression in Populations - http://www.websci13.org/
  • [WebSci2013] Hugo C. Huurdeman, Anat Ben-David, Thaer Sammar - Sprint Methods for Web Archive Research - http://www.websci13.org/
  • [WebSci2013] Bernhard Rieder - Studying Facebook via Data Extraction: The Netvizz Application - http://www.websci13.org/
  • [WebSci2013] Johannes Schantl, ClaudiaWagner, Rene Kaiser, Markus Strohmaier - The Utility of Social and Topical Factors in Anticipating Repliers in Twitter Conversations - http://www.websci13.org/
  • [WebSci2013] Tim Davies, Mark Frank - There’s no such thing as raw data’. Exploring the socio- technical life of a government dataset - http://www.websci13.org/
  • [WebSci2013] Hans Akkermans, Rena Bakhshi - Toward a Next Generation of Network Models for the Web - http://www.websci13.org/
  • [WebSci2013] Antoine Mazieres, Samuel Huron - Toward Google Borders - http://www.websci13.org/
  • [WebSci2013] Sebastien Heymann, Bedicte Le Grand - Towards A Redefinition of Time in Information Networks? - http://www.websci13.org/
  • [WebSci2013] Jisun An, Daniele Quercia, Meeyoung Cha, Krishna Gummadi, Jon Crowcroft - Traditional media seen from social media - http://www.websci13.org/
  • [WebSci2013] Derek O’Callaghan, Derek Greene, Maura Conway, Joe Carthy, Padraig Cunningham - Uncovering the Wider Structure of Extreme Right Communities Spanning Popular Online Networks - http://www.websci13.org/
  • [WebSci2014] Chao Yang, Padmini Srinivasan,  - Translating surveys to surveillance on social media: methodological challenges & solutions - http://www.websci14.org/
  • [WebSci2014] Jiejun Xu, Ryan Compton, Tsai-Ching Lu, David Allen - Rolling through tumblr: characterizing behavioral patterns of the microblogging platform - http://www.websci14.org/
  • [WebSci2014] Asmelash Teka Hadgu, Robert Jäschke - Identifying and analyzing researchers on twitter - http://www.websci14.org/
  • [WebSci2014] Abdullah Almaatouq, Ahmad Alabdulkareem, Mariam Nouh, Erez Shmueli, Mansour Alsaleh, Vivek K. Singh, Abdulrahman Alarifi, Anas Alfaris, Alex (Sandy) Pentland - Twitter: who gets caught? observed trends in social micro-blogging spam - http://www.websci14.org/
  • [WebSci2014] Luam Catao Totti, Felipe Almeida Costa, Sandra Avila, Eduardo Valle, Wagner Meira, Jr., Virgilio Almeida - The impact of visual attributes on online image diffusion - http://www.websci14.org/
  • [WebSci2014] Anh Le, Konstantinos Pelechrinis, Prashant Krishnamurthy - Country-level spatial dynamics of user activity: a case study in location-based social networks - http://www.websci14.org/
  • [WebSci2014] Onur Varol, Emilio Ferrara, Christine L. Ogan, Filippo Menczer, Alessandro Flammini - Evolution of online user behavior during a social upheaval - http://www.websci14.org/
  • [WebSci2014] Oliver Lehmberg, Robert Meusel, Christian Bizer - Graph structure in the web: aggregated by pay-level domain - http://www.websci14.org/
  • [WebSci2014] Sergei Koltcov, Olessia Koltsova, Sergey Nikolenko - Latent dirichlet allocation: stability and applications to studies of user-generated content - http://www.websci14.org/
  • [WebSci2014] Ramine Tinati, Olivier Phillipe, Catherine Pope, Les Carr, Susan Halford - Challenging social media analytics: web science perspectives - http://www.websci14.org/
  • [WebSci2014] Yaser Keneshloo, Jose Cadena, Gizem Korkmaz, Naren Ramakrishnan - Detecting and forecasting domestic political crises: a graph-based approach - http://www.websci14.org/
  • [WebSci2014] Matthew Rowe, Harith Alani - Mining and comparing engagement dynamics across multiple social media platforms - http://www.websci14.org/
  • [WebSci2015] Derek Greene, James P. Cross - Unveiling the Political Agenda of the European Parliament Plenary: A Topical Analysis - http://www.websci15.org/
  • [WebSci2015] Paul Laufer, Claudia Wagner, Fabian Flöck, Markus Strohmaier - Mining cross-cultural relations from Wikipedia: A study of 31 European food cultures - http://www.websci15.org/
  • [WebSci2015] Ujwal Gadiraju, Stefan Dietze, Ernesto Diaz-Aviles - Ranking Buildings and Mining the Web for Popular Architectural Patterns - http://www.websci15.org/
  • [WebSci2015] Anna Zawilska, Steven Albury - An Ethnomethodologically-Informed Approach to Interface Design to Support Collective Web Practice Around Video - http://www.websci15.org/
  • [WebSci2015] Rolf Fredheim, Alfred Moore, John Naughton - Anonymity and Online Commenting: The Broken Windows Effect and the End of Drive-by Commenting - http://www.websci15.org/
  • [WebSci2015] Di Lu, Rosta Farzan - Time to Introduce Myself!: Impact of Self-disclosure Timing of Newcomers in Online Discussion Forums - http://www.websci15.org/
  • [WebSci2015] Debanjan Mahata, John R. Talburt, Vivek Kumar Singh - From Chirps to Whistles: Discovering Event-specific Informative Content from Twitter - http://www.websci15.org/
  • [WebSci2015] Abigail Z. Jacobs, Samuel F. Way, Johan Ugander, Aaron Clauset - Assembling thefacebook: Using Heterogeneity to Understand Online Social Network Assembly - http://www.websci15.org/
  • [WebSci2015] Josh Introne, Sean Goggins - Taming a Menagerie of Heavy Tails with Skew Path Analysis - http://www.websci15.org/
  • [WebSci2015] Han-Teng Liao, King-Wa Fu, Scott A. Hale - How much is said in a microblog?: A multilingual inquiry based on Weibo and Twitter - http://www.websci15.org/
  • [WebSci2015] Ramine Tinati, Markus Luczak-Roesch, Elena Simperl, Nigel Shadbolt, Wendy Hall - '/Command' and Conquer: Analysing Discussion in a Citizen Science Game - http://www.websci15.org/
  • [WebSci2015] Rebecca Nash - Considering a Wider Web?: Employing Multimodal Critical Discourse Analysis in Exploration of Multiple Online Spaces - http://www.websci15.org/
  • [WebSci2015] Mengfan Tang, Pranav Agrawal, Ramesh Jain - Habits vs Environment: What Really Causes Asthma? - http://www.websci15.org/
  • [WebSci2016] Guanliang Chen, Dan Davis, Jun Lin, Claudia Hauff, Geert-Jan Houben - Beyond the MOOC platform: gaining insights about learners from the social web - https://dan7davis.github.io/papers/websci2016_beyond__1_.pdf
  • [WebSci2016] Mariana Arantes, Flavio Figueiredo, Jussara M. Almeida - Understanding video-ad consumption on YouTube: a measurement study on user behavior, popularity, and content properties - https://dl.acm.org/citation.cfm?id=2908159&dl=ACM&coll=DL
  • [WebSci2016] Kyungsik Han, Sanghack Lee, Jin Yea Jang, Yong Jung, Dongwon Lee - Teens are from mars, adults are from venus: analyzing and predicting age groups with behavioral characteristics in instagram - https://dl.acm.org/citation.cfm?id=2908160
  • [WebSci2016] Amaç Herdağdelen, Bogdan State, Lada Adamic, Winter Mason - The social ties of immigrant communities in the United States - https://dl.acm.org/citation.cfm?id=2908163
  • [WebSci2016] Walid Magdy, Kareem Darwish, Norah Abokhodair, Afshin Rahimi, Timothy Baldwin - #ISISisNotIslam or #DeportAllMuslims?: predicting unspoken views - https://dl.acm.org/citation.cfm?id=2908150
  • [WebSci2016] Natalia Boldyrev, Marc Spaniol, Gerhard Weikum - ACROSS: A framework for multi-cultural interlinking of web taxonomies - https://dl.acm.org/citation.cfm?id=2908164
  • [WebSci2016] Katrin Weller, Katharina E. Kinder-Kurlanda - A manifesto for data sharing in social media research - https://dl.acm.org/citation.cfm?id=2908172
  • [WebSci2016] Pablo Loyola, Francisco Szederkenyi, Yutaka Matsuo - Using the web to support political analysis: identifying legislative bill ideology in the chilean parliament - https://dl.acm.org/citation.cfm?id=2908166
  • [WebSci2016] Daniel Alexandrov, Alexey Gorgadze, Ilya Musabirov - Virtual caucasus on VK social networking site - https://dl.acm.org/citation.cfm?id=2908205
  • [WebSci2016] Paolo Boldi, Corrado Monti - LlamaFur: learning latent category matrix to find unexpected relations in Wikipedia - https://dl.acm.org/citation.cfm?id=2908153
  • [WebSci2016] Mattia Samory, Enoch Peserico - Content attribution ignoring content - https://dl.acm.org/citation.cfm?id=2908156
  • [WebSci2016] Yiwei Zhou, Alexandra I. Cristea - Towards detection of influential sentences affecting reputation in wikipedia - https://dl.acm.org/citation.cfm?id=2908177
  • [WebSci2016] Hang Zhang, Vinay Setty - Finding diverse needles in a haystack of comments: social media exploration for news - https://dl.acm.org/citation.cfm?id=2908168
  • [WebSci2016] Gerhard Gossen, Elena Demidova, Thomas Risse - Analyzing web archives through topic and event focused sub-collections - https://dl.acm.org/citation.cfm?id=2908175
  • [WebSci2016] Deniz Iren, Cynthia C. S. Liem, Jie Yang, Alessandro Bozzon - Using social media to reveal social and collective perspectives on music - https://dl.acm.org/citation.cfm?id=2908178
  • [WebSci2017] Xingsheng He, Di Lu, Drew Margolin, Mengdi Wang, Salma El Idrissi, Yu-Ru Lin - The Signals and Noise: Actionable Information in Improvised Social Media Channels During a Disaster - http://www.websci17.org/
  • [WebSci2017] Olga Zagovora, Fabian Flöck, Claudia Wagner - "(Weitergeleitet von Journalistin)": The Gendered Presentation of Professions on Wikipedia - http://www.websci17.org/
  • [WebSci2017] Frederick Ayala-Gómez, Bálint Daróczy, Michael Mathioudakis, András Benczúr, Aristides Gionis - Where Could We Go?: Recommendations for Groups in Location-Based Social Networks - http://www.websci17.org/
  • [WebSci2017] Qing Ke - Sharing Means Renting?: An Entire-marketplace Analysis of Airbnb - http://www.websci17.org/
  • [WebSci2017] Lingzi Hong, Cheng Fu, Paul Torrens, Vanessa Frias-Martinez - Understanding Citizens' and Local Governments' Digital Communications During Natural Disasters: The Case of Snowstorms - http://www.websci17.org/
  • [WebSci2017] Sucheta Soundarajan, Tina Eliassi-Rad, Brian Gallagher, Ali Pinar - ε - WGX: Adaptive Edge Probing for Enhancing Incomplete Networks - http://www.websci17.org/
  • [WebSci2017] Omar Alonso, Vasileios Kandylas, Serge-Eric Tremblay, Jake M. Hofman, Siddhartha Sen - What's Happening and What Happened: Searching the Social Web - http://www.websci17.org/
  • [WebSci2017] Jiejun Xu, Daniel Xie, Tsai-Ching Lu, John Cafeo - EDSV: Emerging Defect Surveillance for Vehicles - http://www.websci17.org/
  • [WebSci2017] Sharath Chandra Guntuku, Weisi Lin, Jordan Carpenter, Wee Keong Ng, Lyle H. Ungar, Daniel Preoţiuc-Pietro - Studying Personality through the Content of Posted and Liked Images on Twitter - http://www.websci17.org/
  • [WebSci2017] Jennifer Golbeck, Zahra Ashktorab, Rashad O. Banjo, Alexandra Berlinger, Siddharth Bhagwan, Cody Buntain, Paul Cheakalos, Alicia A. Geller, Quint Gergory, Rajesh Kumar Gnanasekaran, Raja Rajan Gunasekaran, Kelly M. Hoffman, Jenny Hottle, Vichita Jienjitlert, Shivika Khare, Ryan Lau, Marianna J. Martindale, Shalmali Naik, Heather L. Nixon, Piyush Ramachandran, Kristine M. Rogers, Lisa Rogers, Meghna Sardana Sarin, Gaurav Shahane, Jayanee Thanki, Priyanka Vengataraman, Zijian Wan, Derek Michael Wu - A Large Labeled Corpus for Online Harassment Research - http://www.websci17.org/
  • [WebSci2017] Ramine Tinati, Aastha Madaan, Wendy Hall - InstaCan: Examining Deleted Content on Instagram - http://www.websci17.org/
  • [WebSci2017] Laura Cruz-Albrecht, Jiejun Xu, Kang-Yu Ni, Tsai-Ching Lu - Characterizing Regional and Behavioural Device Variations Across the Twitter Timeline: A Longitudinal Study - http://www.websci17.org/
  • [WebSci2017] Helge Holzmann, Wolfgang Nejdl, Avishek Anand - Exploring Web Archives Through Temporal Anchor Texts - http://www.websci17.org/
  • [WebSci2017] Yasmin AlNoamany, Michele C. Weigle, Michael L. Nelson - Generating Stories From Archived Collections - http://www.websci17.org/
  • [WebSci2017] Nora Alrajebah, Leslie Carr, Markus Luczak-Roesch, Thanassis Tiropanis - Deconstructing Diffusion on Tumblr: Structural and Temporal Aspects - http://www.websci17.org/
  • [WebSci2017] Helena Webb, Marina Jirotka, Bernd Carsten Stahl, William Housley, Adam Edwards, Matthew Williams, Rob Procter, Omer Rana, Pete Burnap - The Ethical Challenges of Publishing Twitter Data for Research Dissemination - http://www.websci17.org/
  • [WebSci2017] Yiming Liao, Thanh Tran, Dongwon Lee, Kyumin Lee - Understanding Temporal Backing Patterns in Online Crowdfunding Communities - http://www.websci17.org/