Questions tagged [data-dump]
For questions about the quarterly Creative Commons data dumps of all public data in the Stack Exchange network Q&A sites.
521 questions
26 votes
1 answer
439 views
Has Stack Exchange Inc. actually declined anyone access to future downloads of SE data dumps?
Over one year ago, Stack Exchange Inc. drastically changed their data dump process and, aside from making it a major pain to download the entire dump of the Stack Exchange network, decided to add the ...
75 votes
6 answers
3k views
Fabricated data in posts.xml for multiple/all data dumps
TL;DR; as I wrote this post parallel to troubleshooting, I went from thinking there was an odd bug to thinking that the company is doing something intentional (or nefarious?) with the data dump again. ...
23 votes
1 answer
355 views
The data dump access page is throwing a 500 server error
Attempting to access the data dump page (this is a "current" user link) throws a server error. This happens across the network, e.g. on Stack Overflow. This has been happening for 2 days now,...
15 votes
0 answers
189 views
Tags starting with dot are missing from `posts.tags` column
Best demonstrated by query which returns no results when run on e.g. AskUbuntu: select * from posts where posttypeid = 1 and tags like '%<.%' This should return, for example, the 1,080 questions ...
28 votes
3 answers
1k views
Posts from deleted users are missing from Data Explorer and Data Dump
I wanted to query for answers where the user account has been deleted, but the answer is still up. According to this link (https://stackoverflow.com/help/deleting-account), deleting your account will ...
10 votes
0 answers
226 views
What is a pre vote?
As seen in New Vote Types in latest data dump?, some new vote types appeared (likely) exclusively in the Stack Overflow data dump. A couple users helped find out what each one meant. However, what ...
15 votes
2 answers
3k views
Data Dumps - updates and bug fixes
Thanks to everyone who posted bug reports and feature requests related to the updated data dumps process. Below, we’ve detailed some work on those reports and requests. Issues reported on this post: ...
21 votes
3 answers
765 views
Latest Data Dump has invalid XML and invalid characters
As I have been looking through the latest StackExchange data dump, it seems like a non-compliant XML serializer was used. There are numerous escape sequences that are simply invalid XML such as &#...