Descrição:
<h3>Overview</h3>
<p>
The <strong>Brazilian Political Protest Dataset (Annotated Tweets)</strong> is a collection of 5,000 manually labeled tweets related to protests in Brazil on September 7, 2021, and subsequent demonstrations in the following days. The dataset captures public discourse on Twitter, including opinions, news, and media content shared by users supporting and opposing the protests.
</p>
<p>
To collect the dataset, we used a keyword-based approach, selecting terms that were trending in Brazil at the time. The 5,000 annotated tweets were manually labeled to support research in political discourse analysis, misinformation detection, and social media studies. Due to the location and context of the protests, most tweets are in Portuguese, with a small portion in English and Spanish.
</p>
<h3>Usage and Applications</h3>
<p>
This dataset might be valuable for research in:
</p>
<ul>
<li><strong>Political Discourse Analysis</strong>: Understanding how different political groups interact online.</li>
<li><strong>Misinformation & Fact-Checking</strong>: Analyzing fake news and manipulated media in protests.</li>
<li><strong>Social Media Engagement & Opinion Mining</strong>: Investigating sentiment and polarization.</li>
<li><strong>Multimodal AI Research</strong>: Studying how text, images, and news links contribute to online discourse.</li>
</ul>
<h3>Media Content</h3>
<p>
Due to the terms of use from the social networks, we do not make publicly available the texts and images that were collected. However, we can provide some extra piece of media content by contacting the authors.
</p>