Topic
Author
Publication year
Geography
Source
Study title
Study group
Collection year
Thematic collection
show more filters
(11) |
(8) |
(7) |
(5) |
(4) |
(2) |
(2) |
more... |
(4) |
more... |
to |
(5) |
(4) |
(3) |
(2) |
(1) |
(1) |
(1) |
to |
(3) |
(5) |
(4) |
(2) |
(1) |
(1) |
(1) |
(1) |
MethodologyMode of Data Collection
Sampling Procedure
Temporal Research Design
Analysis Unit
Kind of Data
More filtersInterview language
(2) |
Analysis Unit
Sort by:
GESIS Search
Find information about social science research data, publications on research data as well as open access publications.
Links between contents are displayed directly in the hit list. For example, you can find matching publications to the research data found.
The results can be filtered quickly and conveniently according to the following categories:
- Research data
- Variables from questionnaires
- Instruments and tools
- Literature
- Publications on research data and survey instruments
- Open-access Publications in the Social Sciences
- Literature on "Women in Science and Research"
- Collections of the GESIS library
- General information and offers on the GESIS websites
Contact: suche@gesis.org
The GESIS Library is a special library for Empirical Social Research and Applied Computer Science. It is available to external users as a reference library in Cologne and Mannheim.
The cooperation with the Research Data Center Education at the DIPF enables us to show you also hits on measurement instruments in the field of educational research. The measurement instruments documented there can be used free of charge for non-commercial purposes.
The cooperation with the Open Test Archive at the Leibniz Institute of Psychology (ZPID) enables us to show you also hits on measurement instruments in the field of psychology and related disciplines. The measurement instruments documented there are protected by copyright and are made available free of charge ("Open Access") for use in research, teaching and practice under a Creative Commons license.
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2860
Abstract: Tweetplomacy 23 is a semantically annotated corpus of tweets capturing digital communicative interaction between international political leaders, peer groups and citizens ... more
Abstract: Tweetplomacy 23 is a semantically annotated corpus of tweets capturing digital communicative interaction between international political leaders, peer groups and citizens ... more
Availability: Free access (without registration)
License: CC BY-NC 4.0: Attribution – NonCommercial (https://creativecommons.org/licenses/by-nc/4.0/deed.de)
Subject area: [Interdisciplinary and Applied Fields of the Social Sciences [=] Mass Communication [=] Information Science [=] Interpersonal Communication]
Topics: Ukraine, Russia, political communication, crisis communication, discourse, discourse analysis, international relations, international politics, epidemic, text communication, text processing, text analysis, social media, energy, vaccination, natural gas, energy supply, crude oil, climate, climate change, greenhouse effect, shortage, Federal Chancellor, career politician, international organization, OECD member country, political leadership, head of state, ministry of foreign affairs
Date(s) of Data Collection: 2018-01; 2023-05
Universe: 1% random sample Twitter/X archive
Notes: keywords- and user list-based extraction
Primärforschende, Institution: Petermann, Jan-Henrik; RedaktionsNetzwerk Deutschland (RND), Bensmann, Felix; GESIS - Leibniz-Institut für Sozialwissenschaften, Zhang, Yudong; GESIS - Leibniz-Institut für Sozialwissenschaften, Dimitrov, Dimitar; GESIS - Leibniz-Institut für Sozialwissenschaften
Publication year: 2025
DOI: 10.7802/2860
Study number: SDN-10.7802-2860
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2860
Publications: Dimitrov, D., Baran, E., Fafalios, F., Yu, R., Zhu, X., Zloch, M., and Dietze, S., TweetsCOV19 -- A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic, 29th ACM, International Conference on Information & Knowledge Management (CIKM2020), Resource Track, ACM 2020
Downloads
- Datasets
- Codebook
Purpose of use:
Downloads:
Downloads:
Tweetplomacy_-_List_of_Handles.xlsx
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | cdd904945157b0993ed733b677567cd2 |
---|---|
Type of file: | Research Data |
File size: | 201.25 KB |
Tweetplomacy_-_List_of_Keywords.xlsx
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 8c5cd821220576b8a3539e0b8db9ce75 |
---|---|
Type of file: | Research Data |
File size: | 10.5 KB |
tweetplomacy-private-23-DE.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 131fb31c72ab6e8c006a77ac96535ced |
---|---|
Type of file: | Research Data |
File size: | 9.1 MB |
tweetplomacy-private-23-EN.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | ecbc82c4185d7eeeb1818c087e0b0444 |
---|---|
Type of file: | Research Data |
File size: | 468.23 MB |
tweetplomacy-private-23-ES.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 3f24a74eb4807aa04f3f0df70505c96a |
---|---|
Type of file: | Research Data |
File size: | 160.06 MB |
tweetplomacy-private-23-FR.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 84d2d29d57eb5f0a52d44dcd7b686bf1 |
---|---|
Type of file: | Research Data |
File size: | 16.17 MB |
tweetplomacy-public-23-DE.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 276490296bd4d9397561a1349cd06cef |
---|---|
Type of file: | Research Data |
File size: | 820.07 KB |
tweetplomacy-public-23-EN.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 022971b56af2f35cdd2b67b3c0990542 |
---|---|
Type of file: | Research Data |
File size: | 33.47 MB |
tweetplomacy-public-23-ES.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 38879b82f67370f6294299dd5d72d56e |
---|---|
Type of file: | Research Data |
File size: | 8.26 MB |
tweetplomacy-public-23-FR.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | cdbacca438cb1767ddab73061c0e701e |
---|---|
Type of file: | Research Data |
File size: | 1.58 MB |
Downloads:
tweetplomacy_Codebook_jsonl-data.txt
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 3ab7dd98d02df4fee3c1eaf8f1cdf80d |
---|---|
Type of file: | Codebook |
File size: | 1.65 KB |
SWP - German Institute for International and Security Affairs. Data File Version 1.0.0, https://doi.org/10.7802/2768
Other Title (type): DIS-KEN (Alternative title)
Abstract: European actors are increasingly relying on strategic communication tools in their external relations, especially in key partner countries like Kenya. Based on a large-sc ... more
Abstract: European actors are increasingly relying on strategic communication tools in their external relations, especially in key partner countries like Kenya. Based on a large-sc ... more
Availability: Free access (without registration)
License: CC BY 4.0: Attribution (https://creativecommons.org/licenses/by/4.0/deed.de)
Subject area: [Political Science [=] Mass Communication]
Date(s) of Data Collection: 2013-01; 2023-07
Geographic coverage: Kenya / KE
Geographic coverage (free): [Ostafrika]
Temporal Research Design: Time series
Mode of Data Collection: Content Analysis
Notes: The embeddings used are based on the Model Deberta V3 Base, which can be found on huggingface: https://huggingface.co/microsoft/deberta-v3-baseFurther models trained for classification and then used are published on huggingface as well: https://huggingface.co/swp-berlin
Primärforschende, Institution: Eickhoff, Karoline; Stiftung Wissenschaft und Politik, Bochtler, Paul; Stiftung Wissenschaft und Politik
Publication year: 2024
DOI: 10.7802/2768
Study number: SDN-10.7802-2768
Contributor, Institution, Role: [Templin, Corinna; Stiftung Wissenschaft und Politik (Data Manager) [=] Sperk, Andrea; Stiftung Wissenschaft und Politik (Data Manager)]
Project funder: German Federal Foreign Office, Federal Ministry for Economic Cooperation and Development, Federal Ministry of Defence
Publisher: SWP - German Institute for International and Security Affairs
Current Version: 1.0.0, https://doi.org/10.7802/2768
Publications: Eickhoff, Karoline; Bochtler, Paul; Owilla, Hesbon Hansen. Communicating Strategically about What? Europe and China in the Kenyan Media. Megatrends Policy Brief (27). Stiftung Wissenschaft und Politik, Berlin., Stiftung Wissenschaft und Politik. (2024). deberta-base-news-topics-kenia-china (Revision 2240235)., Stiftung Wissenschaft und Politik. (2024). deberta-base-news-topics-kenia-europe (Revision 48ee195).
Downloads
- Datasets
- Syntax file
- Methods report
Purpose of use:
Downloads:
Downloads:
dis-ken_embeddings.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f667665a70859ce3d3ff3ced0227464e |
---|---|
Type of file: | Research Data |
File size: | 100.43 MB |
Version number: | 1.0 |
Version date: | 2024-09-16 |
Language: | English / en |
Number of variables: | 773 |
Number of units: | 11691 |
Source | News Aggregator |
Additional information about the file | This comma-separated file contains the embeddings of the news articles based on Deberta (https://huggingface.co/microsoft/deberta-v3-base) |
Purpose of use:
Downloads:
Downloads:
dis-ken_supervised_classification.py
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 2ed5569b560321918bcf7da5261aede1 |
---|---|
Type of file: | Syntax |
File size: | 8 KB |
Version number: | 1.0 |
Version date: | 2024-09-16 |
Language: | English / en |
Software and Version | python > 3.10 |
Additional information about the file | This is the code for the supervised classification. This only includes the code for the case of Europe, but the code for China is the exact same, just with different data files. |
dis-ken_unsupervised_topic_modelling.py
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 716479fb2082ca08db78ded88f469bec |
---|---|
Type of file: | Syntax |
File size: | 6.27 KB |
Version number: | 1.0 |
Version date: | 2024-09-16 |
Language: | English / en |
Software and Version | python > 3.10 |
Additional information about the file | This is the code for the unsupervised classification. This only includes the code for the case of Europe, but the code for China is the exact same, just with different data files. |
Downloads:
dis-ken_method_report.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 7f7741c74c77d2027c8fd143700d7b22 |
---|---|
Type of file: | Methods Report |
File size: | 559.68 KB |
Version number: | 1.0 |
Version date: | 2024-09-16 |
Language: | English / en |
Additional information about the file | This is the method report with the narrative description of the used methods. |
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2824
Abstract: Parliaments are key institutions of democracy. The documents that parliaments produce - including plenary protocols, legislative bills, and ultimately adopted laws - thu ... more
Abstract: Parliaments are key institutions of democracy. The documents that parliaments produce - including plenary protocols, legislative bills, and ultimately adopted laws - thu ... more
Availability: Free access (without registration)
License: CC BY 4.0: Attribution (https://creativecommons.org/licenses/by/4.0/deed.de)
Subject area: [Political Science [=] Science of Communication [=] Science of Literature, Linguistics]
Topics: parliamentary debate, democracy, political communication, text analysis, bill, party politics
Geographic coverage: [Austria / AT [=] Czech Republic / CZ [=] Croatia / HR [=] Denmark / DK [=] Germany / DE [=] Hungary / HU [=] Spain / ES]
Geographic coverage (free): [European Union]
Sampling Procedure: Total Universe / Complete enumeration
Notes: Downloading the data packages can take a long time due to their size (approx. 200 MB - 700 MB), depending on your internet connection. Please note that no progress bar is displayed during the download.
Each zip file contains three separate .rds files per country. One for bills, one for laws, and one for plenary speeches. More information about the data and the project can be found in the codebook and on our website: https://parllawspeech.org/
The PLS corpora have been created under OPTED Work Package 5 (led by Sven-Oliver Proksch, Christian Rauh, and Miklós Sebők) and funded by the European Union’s Horizon 2020 program (Grant agreement 951832).
Each zip file contains three separate .rds files per country. One for bills, one for laws, and one for plenary speeches. More information about the data and the project can be found in the codebook and on our website: https://parllawspeech.org/
The PLS corpora have been created under OPTED Work Package 5 (led by Sven-Oliver Proksch, Christian Rauh, and Miklós Sebők) and funded by the European Union’s Horizon 2020 program (Grant agreement 951832).
Primärforschende, Institution: Schwalbach, Jan; GESIS – Leibniz Institute for the Social Sciences, Hetzer, Lukas; Unversity of Cologne, Proksch, Sven-Oliver; University of Cologne, Rauh, Christian; WZB - Berlin Social Science Center, Sebők, Miklós; Centre for Social Sciences, Budapest
Publication year: 2025
DOI: 10.7802/2824
Study number: SDN-10.7802-2824
Project funder: European Union’s Horizon 2020 program (Grant agreement 951832)
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2824
Downloads
- Datasets
- Codebook
Purpose of use:
Downloads:
Downloads:
Corpora_PLS_EP.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0a28e248a4542ad047d75067fb19d1b8 |
---|---|
Type of file: | Research Data |
File size: | 695.99 MB |
Source | https://eur-lex.europa.eu/homepage.html?locale=en,https://www.europarl.europa.eu/portal/en |
Corpora_PLS_austria.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | b5ff44870b7efee84585ff7ed39b4379 |
---|---|
Type of file: | Research Data |
File size: | 359.18 MB |
Source | https://www.parlament.gv.at/,https://www.ris.bka.gv.at/ |
Corpora_PLS_croatia.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | e4b604b7bf09d8c822bb6c7965d141aa |
---|---|
Type of file: | Research Data |
File size: | 329.12 MB |
Source | https://edoc.sabor.hr/ |
Corpora_PLS_czech_republic.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 29c04d9842b9e55733e4f7a1892febd7 |
---|---|
Type of file: | Research Data |
File size: | 202.52 MB |
Source | https://www.aspi.cz,https://www.psp.cz |
Corpora_PLS_denmark.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | bca48378c0e9e54a6fc95b3bd6b3aafe |
---|---|
Type of file: | Research Data |
File size: | 325.82 MB |
Source | https://www.retsinformation.dk/,https://www.ft.dk |
Corpora_PLS_germany.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | b75f1779135a53cb6e60b1f0544d0202 |
---|---|
Type of file: | Research Data |
File size: | 340.06 MB |
Source | https://dip.bundestag.de/,https://www.bgbl.de |
Corpora_PLS_hungary.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | fd2282e1182b47566e21ce56ea411945 |
---|---|
Type of file: | Research Data |
File size: | 725.56 MB |
Source | https://www.parlament.hu |
Corpora_PLS_spain.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 01d91a1144de1f97c242bd0661e2289d |
---|---|
Type of file: | Research Data |
File size: | 385.42 MB |
Source | https://www.congreso.es/es/,https://www.senado.es/web/index.html |
Downloads:
Codebook_ParlLawSpeech.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 7b08740ccc966096b8fd5c51bc4d03f5 |
---|---|
Type of file: | Codebook |
File size: | 490.17 KB |
Schellhammer, Sebastian; Baran, Erdal; Bensmann, FelixDimitrov, Dr. Dimitar; Dietze, Stefan; Zhang, Yudong
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2781
Abstract: TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 3.1 billion tweets, spann ... more
Abstract: TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 3.1 billion tweets, spann ... more
Availability: Free access (without registration)
License: Data can only be used for non-commercial research
Date(s) of Data Collection: 2022-09; 2023-06
Universe: a 1% sample of all tweets from Sep 2022 unti Jun 2023
Notes: The dataset consists of english and spam-filtered tweets from a 1% sample of all tweets from Sep 2022 until June 2023.
Primärforschende, Institution: Schellhammer, Sebastian; GESIS - Leibniz-Institut für Sozialwissenschaften, Baran, Erdal; GESIS - Leibniz-Institut für Sozialwissenschaften, Bensmann, Felix; GESIS - Leibniz-Institut für Sozialwissenschaften, Dimitrov, Dr. Dimitar; GESIS - Leibniz-Institut für Sozialwissenschaften, Dietze, Stefan; GESIS - Leibniz-Institut für Sozialwissenschaften & Heinrich-Heine-University Düsseldorf, Germany & L3S Research Center, Hannover, Germany, Zhang, Yudong; GESIS - Leibniz-Institut für Sozialwissenschaften
Publication year: 2024
DOI: 10.7802/2781
Study number: SDN-10.7802-2781
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2781
Publications: P. Fafalios, V. Iosifidis, E. Ntoutsi, and S. Dietze, TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets, 15th Extended Semantic Web Conference (ESWC'18), Heraklion, Crete, Greece, June 3-7, 2018.
Downloads
- Datasets
Purpose of use:
Downloads:
Downloads:
month_2022-09.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | e9f1fb2df70f528e84d1b12a69ba2715 |
---|---|
Type of file: | Research Data |
File size: | 3.76 GB |
month_2022-10.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | ea1c996b1022d734d427bdab65015dde |
---|---|
Type of file: | Research Data |
File size: | 3.96 GB |
month_2022-11.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 2ce9f2be48d762d488b7598cec770585 |
---|---|
Type of file: | Research Data |
File size: | 4 GB |
month_2022-12.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0ef024aea82a77ee922157b451ac48e4 |
---|---|
Type of file: | Research Data |
File size: | 3.78 GB |
month_2023-01.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0a1952d1f041851e8ceb5f22bc18c1a5 |
---|---|
Type of file: | Research Data |
File size: | 3.55 GB |
month_2023-02.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 3977fcc6d4a14d88fe2fdfd1f45932f6 |
---|---|
Type of file: | Research Data |
File size: | 3.28 GB |
month_2023-03.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | e6c78f10baff1b0dd84a6e3900a7b6f4 |
---|---|
Type of file: | Research Data |
File size: | 3.83 GB |
month_2023-04.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f0f5d46fd08298004ee400867dbcc919 |
---|---|
Type of file: | Research Data |
File size: | 3.59 GB |
month_2023-05.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f7a34f2bf1b7c997a091103afe72b8cd |
---|---|
Type of file: | Research Data |
File size: | 3.66 GB |
month_2023-06.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f697d3d29443b48fb34054ece42a5659 |
---|---|
Type of file: | Research Data |
File size: | 1.56 GB |
GESIS, Cologne. Data File Version 2.0.0, https://doi.org/10.7802/2847
Other Title (type): An archive of Twitter/X's policies for Tweet redistribution 2006-2023 (Original title)
Abstract: When researchers publish results based on the analysis of Tweets, good practice requires sharing the Tweets (or Tweet IDs) to enable reproducibility of the results. What ... more
Abstract: When researchers publish results based on the analysis of Tweets, good practice requires sharing the Tweets (or Tweet IDs) to enable reproducibility of the results. What ... more
Availability: Free access (without registration)
Subject area: [Information Science]
Date(s) of Data Collection: 2024-07; 2024-08
Mode of Data Collection: Compilation/Synthesis
Primärforschende, Institution: Golland, Luisa; GESIS - Leibniz-Institute for the Social Sciences, Recker, Jonas; GESIS - Leibniz-Institute for the Social Sciences, Schwalbach, Jan; GESIS - Leibniz-Institute for the Social Sciences
Publication year: 2025
DOI: 10.7802/2847
Study number: SDN-10.7802-2847
Contributor, Institution, Role: [Bishop, Elizabeth; GESIS - Leibniz-Institute for the Social Sciences (Project Member) [=] Watteler, Oliver; GESIS - Leibniz-Institute for the Social Sciences (Project Member)]
Publisher: GESIS, Cologne
Current Version: 2.0.0, https://doi.org/10.7802/2847
Version history:
Version number | Changes in this version |
---|---|
2.0.0 (aktuelle Version) | 2025-02-12 An RDS corpus created from the .html files has been added along with the R script used to create this corpus. A "restriction_score" and scores for the number of objects allowed to be reditributed was added to "TwitterX_policies_metadata.csv". The file "TwitterX_policies_restrict.csv" was added. The Methods Report was updated. The reference information in the readme file and the scripts in "TwitterX_policies_python.zip" was updated. https://doi.org/10.7802/2847 |
1.0.0 | 2024-09-16 https://doi.org/10.7802/2761 |
Downloads
- Datasets
- Syntax file
- Methods report
Purpose of use:
Downloads:
Downloads:
TwitterX_policies_htmls.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 087446d8c3ff5f2890d400366ba9a9a2 |
---|---|
Type of file: | Research Data |
File size: | 1.65 MB |
Number of units: | 32 |
Source | The Internet Archive / Wayback Machine (https://web.archive.org/) |
Additional information about the file | This archive contains 32 .html files. |
TwitterX_policies_metadata_v2-0-0.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | a65116a7bc4494a8e0c60e7326ff5748 |
---|---|
Type of file: | Research Data |
File size: | 39.45 KB |
Version number: | 2.0.0 |
Version date: | 2025-02-26 |
Additional information about the file | Tab-separated; this .csv contains the Wayback Machine snapshot URLs for the downloaded .html files, and additional information about each document archived. The file also serves as input file for the Python scripts. A codebook is published in TwitterX_Policies_Methods_Report_V2-0-0.pdf. |
TwitterX_policies_restrict.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 9d33046b5249f01f54a27313a5309808 |
---|---|
Type of file: | Research Data |
File size: | 6.59 KB |
Version number: | 1.0.0 |
Version date: | 2025-02-25 |
Additional information about the file | tab-separated; contains schema for calculating the restriction score for each version of the regulations for the redistribution of platform content described in 'TwitterX_policies_metadata_v2-0-0.csv' |
TwitterX_tos_corpus.RDS
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 20313bf98042e080d37a062676b07558 |
---|---|
Type of file: | Research Data |
File size: | 138.71 KB |
Software and Version | R 4.4.2 |
Additional information about the file | The corpus created with the R script TwitterX_tos_corpus.R. |
Purpose of use:
Downloads:
Downloads:
TwitterX_policies_python_v2-0-0.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 5c30b7b82490db58f75a6bf03a8902cc |
---|---|
Type of file: | Syntax |
File size: | 3.3 KB |
Software and Version | Python 3.12.0 |
Additional information about the file | This archive contains three .py files that can be used to replicate the creation of the .html files released here. |
TwitterX_tos_corpus.R
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | fe56ddde37d09bc266bd44e03f209c10 |
---|---|
Type of file: | Syntax |
File size: | 9.63 KB |
Software and Version | R 4.4.2 |
Additional information about the file | The R script to read in the relevant text parts of the html files and turning them into a corpus with the respective metadata. The script was created and run using R 4.4.2 using the latest version (10th of February 2025) of the following packages: tidyverse, rvest, SentimentAnalysis, quanteda.textstats, stringdist, textreuse, lubridate, ggplot2. |
Downloads:
TwitterX_Policies_Methods_Report_V2-0-0.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f912ac57026e52b3414edbef751ec936 |
---|---|
Type of file: | Methods Report |
File size: | 229.29 KB |
Version number: | 2.0.0 |
Version date: | 2025-02-25 |
Language: | English / en |
SWP - German Institute for International and Security Affairs. Data File Version 1.0.0, https://doi.org/10.7802/2691
Other Title (type): iran_israel_maghreb (Alternative title)
Abstract: The research data described below was collected as part of the SWP study "Friends and Foes: the instrumentalisation of Israel and Iran in the Maghreb". The period of qual ... more
Abstract: The research data described below was collected as part of the SWP study "Friends and Foes: the instrumentalisation of Israel and Iran in the Maghreb". The period of qual ... more
Availability: Free access (without registration)
Subject area: [Peace and Conflict Research, International Conflicts, Security Policy]
Topics: Iran, Israel, authoritarianism, regional factors, conflict potential, Middle East conflict, social media, news agency, text analysis, human rights, freedom of opinion, Maghreb region, propaganda, instrumentalization, twitter
Date(s) of Data Collection: 2020-01-01; 2022-06-01
Geographic coverage: [Israel / IL [=] Algeria / DZ [=] Morocco / MA [=] Iran (Islamic Republic of) / IR [=] Tunisia / TN]
Geographic coverage (free): [Maghreb [=] Nordafrika [=] Nahost]
Universe: Alle Nachrichtentexte der Nachrichtenagenturen; Alle Tweets
Sampling Procedure: Non-probability Sample - Purposive Sample
Temporal Research Design: Time series
Mode of Data Collection: Content Analysis
Primärforschende, Institution: Werenfels, Isabelle; Stiftung Wissenschaft und Politik, Bochtler, Paul; Stiftung Wissenschaft und Politik
Publication year: 2024
DOI: 10.7802/2691
Study number: SDN-10.7802-2691
Contributor, Institution, Role: [Mourad, Pelican; Stiftung Wissenschaft und Politik (Related Person) [=] Bousnina, Amina (Related Person)]
Publisher: SWP - German Institute for International and Security Affairs
Current Version: 1.0.0, https://doi.org/10.7802/2691
Downloads
- Datasets
- Methods report
Purpose of use:
Downloads:
Downloads:
iran_israel_maghreb-news_text_manual_codes.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 91f981b705430c411b01d71c3e8cd1cf |
---|---|
Type of file: | Research Data |
File size: | 59.42 KB |
Version number: | 1.0 |
Version date: | 2024-02-27 |
Language: | French / fr |
Number of variables: | 6 |
Number of units: | 429 |
Source | Algérie Presse Service (APS),Maghreb Arab Presse (MAP) |
Additional information about the file | Sentiment Codes of Newstexts from News Agencies in Morocco and Algeria published between 01.01.2020 and 01.06.2022. |
iran_israel_maghreb-tweets_hashtags.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | b269e5e70692f30758f77d2ece673412 |
---|---|
Type of file: | Research Data |
File size: | 26.82 MB |
Version number: | 1.0 |
Version date: | 2024-02-27 |
Language: | Arabic / ar |
Number of variables: | 2 |
Number of units: | 146983 |
Source | |
Additional information about the file | Sample of tweets analyzed to look at the location and importance of hashtags in the discourse around the normalisation of relations with Israel. |
iran_israel_maghreb-tweets_influencer.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 75385993c8fac64769a9398e194d80cc |
---|---|
Type of file: | Research Data |
File size: | 1.15 MB |
Version number: | 1.0 |
Version date: | 2024-02-27 |
Language: | Arabic / ar |
Number of variables: | 1 |
Number of units: | 43624 |
Source | |
Additional information about the file | Sample of tweets that had relevant keywords concerning regional players from a qualitatively selected sample of important influencers in the Maghreb region. |
iran_israel_maghreb-tweets_sentiment_iran.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0a23685e36913120e858d8a7a4568f7d |
---|---|
Type of file: | Research Data |
File size: | 12.14 KB |
Language: | Arabic / ar |
Number of variables: | 1 |
Number of units: | 621 |
Source | |
Additional information about the file | Sample of tweets used for sentiment classification of stance towards Iran |
iran_israel_maghreb-tweets_sentiment_israel_normalisierung.csv
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | f1732bec9e1839e86e27b73cc1c96f31 |
---|---|
Type of file: | Research Data |
File size: | 10.1 KB |
Version number: | 1.0 |
Version date: | 2024-02-27 |
Language: | Arabic / ar |
Number of variables: | 2 |
Number of units: | 516 |
Source | |
Additional information about the file | Sample of tweets used for sentiment classification of stance towards a normalization with Israel |
Downloads:
iran_israel_maghreb-method_report.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 6b30e7553b3fb2d81bb394c82d5359f3 |
---|---|
Type of file: | Methods Report |
File size: | 387.16 KB |
Version number: | 1.0 |
Version date: | 2024-02-27 |
Language: | English / en |
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2744
Other Title (type): Text Corpus of Speeches in German State Parliaments (Untertitel)
Abstract: StateParl includes the speeches in all 16 German state parliaments between 2000 and 2022. The database consists of 9,531,215 paragraphs and 345,068,110 words. Stenographi ... more
Abstract: StateParl includes the speeches in all 16 German state parliaments between 2000 and 2022. The database consists of 9,531,215 paragraphs and 345,068,110 words. Stenographi ... more
Availability: Restricted Access
Subject area: [Politikwissenschaft [=] Sozialwissenschaften [=] Kommunikationswissenschaften [=] Staat, staatliche Organisationsformen [=] politische Willensbildung, politische Soziologie, politische Kultur [=] Kommunikationssoziologie, Sprachsoziologie, Soziolinguistik]
Topics: Landtag, federalism, parliament, parliamentary debate, party system, multi-party system, coalition, coalition policy, content analysis, text analysis, political communication, discourse analysis
Date(s) of Data Collection: 2023-04; 2023-10
Geographic coverage: Germany / DE
Geographic coverage (free): [Baden-Württemberg [=] Bayern [=] Berlin [=] Brandenburg [=] Bremen [=] Hamburg [=] Hessen [=] Mecklenburg-Vorpommern [=] Niedersachsen [=] Nordrhein-Westfalen [=] Rheinland-Pfalz [=] Saarland [=] Sachsen [=] Sachsen-Anhalt [=] Schleswig-Holstein [=] Thüringen]
Universe: Speeches in the 16 German state parliaments between 2000 and 2022
Sampling Procedure: Total Universe / Complete enumeration
Primärforschende, Institution: Beltermann, Eric; Freie Universität Berlin, Souris, Antonios; Freie Universität Berlin, Nguyen, Christoph; Freie Universität Berlin, Kropp, Sabine; Freie Universität Berlin
Publication year: 2024
DOI: 10.7802/2744
Study number: SDN-10.7802-2744
Project funder: VolkswagenStiftung, Freie Universität Berlin
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2744
Version history:
Version number | Changes in this version |
---|---|
2.0.0 (aktuelle Version) | 2025-03-03
|
1.0.0 | 2024-07-31 https://doi.org/10.7802/2744 |
Downloads
- Datasets
- Codebook
- Project report
Note:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Purpose of use:
Downloads:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Purpose of use:
Downloads:
stateparl-2.3.0-beta_csv.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 88984a2568a8980bd18a7b9f3a1088bf |
---|---|
Type of file: | Research Data |
File size: | 850.75 MB |
Version number: | 2.3.0 |
Version date: | 2024-07-15 |
Language: | German / de,English / en |
Number of variables: | 9 |
Number of units: | 9534937 |
Source | Stenographic protocols of the state parliaments |
stateparl-2.3.0-beta_rds.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 19117f7399c3a58e28a92be08011209b |
---|---|
Type of file: | Research Data |
File size: | 788.5 MB |
Version number: | 2.3.0 |
Version date: | 2024-07-15 |
Language: | German / de,English / en |
Number of variables: | 9 |
Number of units: | 9534937 |
Source | Stenographic protocols of the state parliaments |
stateparl-2.3.0-beta_xml.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 032be1f955a0777194a834e1a2aecaab |
---|---|
Type of file: | Research Data |
File size: | 950.98 MB |
Version number: | 2.3.0 |
Version date: | 2024-07-15 |
Language: | German / de,English / en |
Number of variables: | 9 |
Number of units: | 9534937 |
Source | Stenographic protocols of the state parliaments |
Note:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Downloads:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Downloads:
20240729_StateParl_codebook_final.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 1fe99b627f140997294f23200d30223b |
---|---|
Type of file: | Codebook |
File size: | 178.97 KB |
Version date: | 2024-07-29 |
Language: | English / en |
Note:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Downloads:
Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.
Please request access by email: Request access.
As soon as your request has been approved, you can log in and download the data here.
Downloads:
20240729_StateParl_documentation_and_codebook_final.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | a38bc91a26113828a5cca3ae38732ec3 |
---|---|
Type of file: | Project Report |
File size: | 1004.89 KB |
Version date: | 2024-07-29 |
Language: | English / en |
GESIS, Cologne. Data File Version 2.0.0, https://doi.org/10.7802/2854
Other Title (type): Text Corpus of Speeches in German State Parliaments (Subtitle)
Abstract: StateParl contains the parliamentary speeches of members of parliament and government representatives in all 16 German state parliaments from 1.1.2000 to 31.12.2023. Sten ... more
Abstract: StateParl contains the parliamentary speeches of members of parliament and government representatives in all 16 German state parliaments from 1.1.2000 to 31.12.2023. Sten ... more
Availability: Free access (with registration)
Subject area: [Political Science [=] Social Sciences [=] Science of Communication [=] Political System, Constitution, Government [=] Political Process, Elections, Political Sociology, Political Culture [=] Sociology of Communication, Sociology of Language, Sociolinguistics]
Topics: Landtag, federalism, parliament, parliamentary debate, party system, multi-party system, coalition, coalition policy, content analysis, text analysis, political communication, discourse analysis
Date(s) of Data Collection: 2023-04; 2024-12
Geographic coverage: [Germany / DE]
Geographic coverage (free): [Baden-Württemberg [=] Bayern [=] Berlin [=] Brandenburg [=] Bremen [=] Hamburg [=] Hessen [=] Mecklenburg-Vorpommern [=] Niedersachsen [=] Nordrhein-Westfalen [=] Rheinland-Pfalz [=] Saarland [=] Sachsen [=] Sachsen-Anhalt [=] Schleswig-Holstein [=] Thüringen]
Universe: Speeches in the 16 German state parliaments between 2000 and 2023
Sampling Procedure: Total Universe / Complete enumeration
Primärforschende, Institution: Beltermann, Eric; Freie Universität Berlin, Souris, Antonios; Freie Universität Berlin, Nguyen, Christoph; Freie Universität Berlin, Kropp, Sabine; Freie Universität Berlin
Publication year: 2025
DOI: 10.7802/2854
Study number: SDN-10.7802-2854
Project funder: VolkswagenStiftung, Freie Universität Berlin
Publisher: GESIS, Cologne
Current Version: 2.0.0, https://doi.org/10.7802/2854
Version history:
Version number | Changes in this version |
---|---|
2.0.0 (aktuelle Version) | 2025-03-03
|
1.0.0 | 2024-07-31 https://doi.org/10.7802/2744 |
Downloads
- Datasets
- Project report
Purpose of use:
The download of these files requires a login at GESIS. Registration at GESIS is free of charge, open to all and gives you access to various GESIS services.
Downloads:
The download of these files requires a login at GESIS. Registration at GESIS is free of charge, open to all and gives you access to various GESIS services.
Downloads:
stateparl_csv.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | d8e82b1c230126ba67b0572d1611f548 |
---|---|
Type of file: | Research Data |
File size: | 1.14 GB |
Version number: | 2.0 |
Version date: | 2025-03-04 |
Language: | German / de,English / en |
Source | Stenographic protocols of the state parliaments |
Additional information about the file | The zip-file contains three comma-separated datasets: - mandateMappings.csv (Number of units: 10,179; Number of Variables: 2) - paragraphs.csv (Number of units: 14,092,664; Number of Variables: 12) - protocols.csv (Number of units: 8,268; Number of Variables: 6) |
stateparl_rds.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | d2fd1091de4e0d0ebe06c8ebca9c1cc6 |
---|---|
Type of file: | Research Data |
File size: | 1.03 GB |
Version number: | 2.0 |
Version date: | 2025-03-04 |
Language: | German / de,English / en |
Source | Stenographic protocols of the state parliaments |
Additional information about the file | The zip-file contains three datasets: - mandateMappings.rds (Number of units: 10,179; Number of Variables: 2) - paragraphs.rds (Number of units: 14,092,664; Number of Variables: 12) - protocols.rds (Number of units: 8,268; Number of Variables: 6) |
The download of these files requires a login at GESIS. Registration at GESIS is free of charge, open to all and gives you access to various GESIS services.
Downloads:
Downloads:
20250211_StateParl_documentation_and_codebook_release-candidate_final.pdf
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | fe9204b54522162b35d167d76b117503 |
---|---|
Type of file: | Project Report |
File size: | 1.04 MB |
Language: | English / en |
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2825
Abstract: TeleScope is an extensive dataset suite that comprises metadata for about 500K Telegram channels and downloaded message metadata from all 71K public channels within this ... more
Abstract: TeleScope is an extensive dataset suite that comprises metadata for about 500K Telegram channels and downloaded message metadata from all 71K public channels within this ... more
Availability: Free access (without registration)
Subject area: [Computational Social Science [=] Social Sciences [=] Other Fields of the Applied Social Sciences [=] Information Science]
Date(s) of Data Collection: 2024-02; 2024-10
Universe: Initial selection is based on top 100 TGStat channels according to Subscribers, Reach and Citation Criteria. The growth of the dataset is based on snowball sampling.
Sampling Procedure: The growth of the dataset is based on snowball sampling
Temporal Research Design: Longitudinal (panel study)
Primärforschende, Institution: Gangopadhyay, Susmita; GESIS – Leibniz Institute for the Social Sciences, Dessi, Danilo; University of Sharjah, Dimitrov, Dimitar; GESIS – Leibniz Institute for the Social Sciences, Dietze, Stefan; Heinrich Heine University Düsseldorf
Publication year: 2025
DOI: 10.7802/2825
Study number: SDN-10.7802-2825
Contributor, Institution, Role: [Dessi, Danilo; University of Sharjah (Contact Person) [=] Dimitrov, Dimitar; GESIS – Leibniz Institute for the Social Sciences, (Contact Person) [=] Dietze, Stefan; Heinrich Heine University Düsseldorf (Contact Person)]
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2825
Downloads
- Datasets
- Technical report
Purpose of use:
Downloads:
Downloads:
1000015666_to_1054523330.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | cb07fc5d4d6da0b12a20378fcc95e244 |
---|---|
Type of file: | Research Data |
File size: | 2.97 GB |
1054549314_to_1092663584.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | a2f4e53b8b2451a5c940d8d577ac061e |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1092691131_to_1124443753.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 7d449c7835783976793dec38966ba804 |
---|---|
Type of file: | Research Data |
File size: | 2.95 GB |
1124469578_to_1147333336.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0f94da045401b6027111d94ec40740ff |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1147336903_to_1180665618.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | c63bc922cfd53916c5c31fd38b71c4b1 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1180680018_to_1219762447.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | da701e80b855a55d84e9d01666bb7314 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1219766382_to_1255194806.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 79b78093927812d0020536de298cb7ed |
---|---|
Type of file: | Research Data |
File size: | 2.99 GB |
1255195746_to_1291104162.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 8f19349cf3e568629a79c6e0078c91d3 |
---|---|
Type of file: | Research Data |
File size: | 2.99 GB |
1291116365_to_1325779356.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 7c6f55ea796d82b26b068bd59f785d4a |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1325786124_to_1364416179.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 7b52b9a32f79b96dba6a6b21592b0393 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1364421816_to_1397037295.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 047ad133adac1b5ad9fe43e676937ea9 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1397113490_to_1436310453.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 984edf97d42dca34d1b2c6090f174251 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1436320749_to_1469586346.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | fa702496cc165927932374538739c00b |
---|---|
Type of file: | Research Data |
File size: | 2.99 GB |
1469592192_to_1508140759.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0e0f3af516c5e3a6da6f607e70fbd239 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1508149386_to_1568817860.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 13e9602054fcacf7ddb0eca731470602 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1568821876_to_1625726066.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | a89b11236a8c5e874db2841d846e6be8 |
---|---|
Type of file: | Research Data |
File size: | 2.91 GB |
1625754907_to_1677378798.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 4d544143c8124ff5bd6b55c49152022a |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1677440310_to_1731205486.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 03f299bd29f922001321eccf037a5f14 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1731206105_to_1762981965.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 37d83097e3f816a4e71e11c128316858 |
---|---|
Type of file: | Research Data |
File size: | 2.99 GB |
1762989759_to_1778345773.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 3d3f51dbf23bc8be1a25403dec9a986e |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1778346527_to_1792456646.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 31163fd4bd477d0815a41c26fab0374a |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1792460600_to_1826619557.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | c8cb5235beb1f51ab147a7fbc3a69e8d |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1826634808_to_1877649970.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0d5730f67bb91425bae891528f9829e3 |
---|---|
Type of file: | Research Data |
File size: | 3 GB |
1877677892_to_1961731638.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | d16983cc2f1ec421b4dbf7dc288b2fb3 |
---|---|
Type of file: | Research Data |
File size: | 2.99 GB |
1961746451_to_2174838182.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | ce65865403766e0dce4c498fabe43163 |
---|---|
Type of file: | Research Data |
File size: | 1.97 GB |
Channel_Interaction_Data_CID__v1-0-0.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 1b5a4476457db7fe65ff84658891a0a0 |
---|---|
Type of file: | Research Data |
File size: | 645.95 MB |
Additional information about the file | This folder describes message propagation and interaction dynamics between channels. It contains structured information about user interactions, supporting comprehensive analyses of channel behavior and interactions. This folder contains three files: Channel_to_Channel_Graph: Illustrates how Telegram channels are connected through message forwarding. Channels are represented as nodes, and each forwarding event is depicted as a directed edge. The file includes the source channel ID, destination channel ID, and a list of message IDs forwarded between the two channels. A combination of the source channel ID and the message ID allows retrieval of the message's contents. Message_Forwarding_Flows: Provides a detailed record of the complete path a message takes as it is forwarded across channels. It includes: Connected Component ID: A unique identifier for the propagation of a specific message. Connected Component Size: The total length of the message's propagation, indicating its reach. Connected Component Nodes: A detailed sequence combining channel IDs and message IDs, representing the entire journey of the message. As a message traverses the network, it moves through various channels, each associated with distinct channel IDs and message IDs. This file comprehensively maps the entire propagation path of the message across the Telegram network. Aggregated_User_Interactions: Provides an aggregated view of user interactions, such as reactions, forwards, and views associated with a message across the entire network. Each record in this file is linked to the Message Forwarding Flows through the connected component ID, ensuring that the data corresponds to a specific message as it propagates through the Telegram network. |
Channel_Metadata_and_Enrichments_v1-0-0.zip
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 87c265af1ad04f73620bb1e81c1a3b17 |
---|---|
Type of file: | Research Data |
File size: | 34.69 MB |
Version number: | 1 |
Version date: | 2025-01-14 |
Additional information about the file | This folder contains three files: Source Channel Metadata: The largest collection of Telegram channel metadata, comprising 534,137 channels discovered through a snowball sampling approach. This dataset serves as a comprehensive registry for selecting channels tailored to specific research needs. Channel_Language: Contains the primary language detected for 71K public channels within the 534,137 channels, using the Python Langdetect library (Langdetect on PyPI). Temporal Channel Metadata: Includes information about the distribution of messages across daily hours within a single channel, enabling analysis of the most active time intervals for specific channels. |
TeleScope_seedlist_v1-0-0.xls
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | ef01db29b9e47bd60b57eca8c41abeb3 |
---|---|
Type of file: | Research Data |
File size: | 36.5 KB |
Additional information about the file | In addition to the data suite, we provide seedlists comprising 251 unique channel IDs sourced from the top 100 channels based on citation, reach, and subscriber categories, as listed on the TGStat website. It is important to note that while we initially gathered data from the top 100 channels, some IDs could not be retrieved or resolved. This may be due to changes in channel usernames or the deletion of entire channels during the extraction period. |
Downloads:
TeleScope_readme_v1-0-0.txt
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 90bc044b4aa118224fcca356c2da837a |
---|---|
Type of file: | Technical Report |
File size: | 4.62 KB |
Version number: | 1 |
Version date: | 2025-01-21 |
GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2472
Abstract: TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning ... more
Abstract: TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning ... more
Availability: Free access (without registration)
License: Data can only be used for non-commercial research
Date(s) of Data Collection: 2021-01; 2021-12
Universe: a 1% sample of all tweets from Jan 2021 until Dec 2021
Notes: The dataset consists of english and spam-filtered tweets from a 1% sample of all tweets from Jan 2021 until Dec 2021.
Primärforschende, Institution: Baran, Erdal; GESIS - Leibniz-Institut für Sozialwissenschaften, Bensmann, Felix; GESIS - Leibniz-Institut für Sozialwissenschaften, Dietze, Stefan; GESIS - Leibniz-Institut für Sozialwissenschaften & Heinrich-Heine-University Düsseldorf, Germany & L3S Research Center, Hannover, Germany
Publication year: 2022
DOI: 10.7802/2472
Study number: SDN-10.7802-2472
Publisher: GESIS, Cologne
Current Version: 1.0.0, https://doi.org/10.7802/2472
Publications: P. Fafalios, V. Iosifidis, E. Ntoutsi, and S. Dietze,
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets,
15th Extended Semantic Web Conference (ESWC'18), Heraklion, Crete, Greece, June 3-7, 2018.
Downloads
- Datasets
Purpose of use:
Downloads:
Downloads:
month_2021-01.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | af4ba51d9b980587913151812a021d3a |
---|---|
Type of file: | Research Data |
File size: | 4.17 GB |
month_2021-02.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 818c33cff76b35a61b9a80f642c360a6 |
---|---|
Type of file: | Research Data |
File size: | 3.74 GB |
month_2021-03.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 994d2a098ba84a69556ae4bce63a79d7 |
---|---|
Type of file: | Research Data |
File size: | 4.05 GB |
month_2021-04.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | fe434a3523217df66ca622317ee6f58c |
---|---|
Type of file: | Research Data |
File size: | 3.82 GB |
month_2021-05.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 82cc41a1399201bbb3dc8db1fc1db71f |
---|---|
Type of file: | Research Data |
File size: | 4.04 GB |
month_2021-06.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | c2138b21e12a473d2cc5f3b0dfc42488 |
---|---|
Type of file: | Research Data |
File size: | 3.72 GB |
month_2021-07.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | b8991fe081bdefbc81641942f2e47214 |
---|---|
Type of file: | Research Data |
File size: | 3.77 GB |
month_2021-08.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 1336c44885957856e3f660f48604c24f |
---|---|
Type of file: | Research Data |
File size: | 3.79 GB |
month_2021-09.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | cd0b056b420b3de3962687454d713e55 |
---|---|
Type of file: | Research Data |
File size: | 3.56 GB |
month_2021-10.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | c0bb6466b15eb93f9d986666eb2a73ba |
---|---|
Type of file: | Research Data |
File size: | 3.67 GB |
month_2021-11.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | 0fe3dd838f10c1983d837968a9396c3f |
---|---|
Type of file: | Research Data |
File size: | 3.64 GB |
month_2021-12.gz
Zusätzliche Angaben zu der Datei show
Zusätzliche Angaben zu der Datei show
MD5: | efa467e4b0a1292acca8f55986ef0df5 |
---|---|
Type of file: | Research Data |
File size: | 3.49 GB |