AFFILIATE RESEARCH
DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter
Kai-Cheng Yang, Pranav Goel, Alexi Quintana-Mathé, Luke Horgan, Stefan D. McCabe, Nir Grinberg, Kenneth Joseph, and David Lazer | January 2025
Abstract: Social media play a pivotal role in disseminating web content, particularly dur- ing elections, yet our understanding of the association between demographic factors and political discourse online remains limited. Here, we introduce a unique dataset, DomainDemo, linking domains shared on Twitter (X) with the demographic charac- teristics of associated users, including age, gender, race, political affiliation, and ge- olocation, from 2011 to 2022. This new resource was derived from a panel of over 1.5 million Twitter users matched against their U.S. voter registration records, facilitating a better understanding of a decade of information flows on one of the most prominent social media platforms and trends in political and public discourse among registered U.S. voters from different sociodemographic groups. By aggregating user demographic information onto the domains, we derive five metrics that provide critical insights into over 129,000 websites. In particular, the localness and partisan audience metrics quan- tify the domains’ geographical reach and ideological orientation, respectively. These metrics show substantial agreement with existing classifications, suggesting the effec- tiveness and reliability of DomainDemo’s approach.
Contact Us
Are you interested in joining the IDI team or have a story to tell? reach out to us at j.wihbey@northeastern.edu