I wanted the advice of reddit here.
I have a search company I've run for the past decade which I'm shutting down in the next 60 days.
We had a good run but got hit by 2-3 major issues at once which makes the business model non-viable in the long term.
We have > 1PB of content. Mostly social media content.
This is all public data. Nothing private or nefarious like private email addresses or passwords.
I don't want to just delete it but I also realize it's insanely expensive to move around.
I also think it would be very irresponsible to just publish it openly. There are plenty of bad actors who would love to do something if they could get this much data for cheap/free.
However, the data is very valuable to researchers. Especially in light of the 2016 election. We have social media data for the last 4 years.
I want to preserve it but want to do so responsibly.
I'm also email@example.com if you'd like to reach out directly.