I bet there are multiple copies of various depth by various groups including reddit themselves,. But how they will use it, wether they share it we cant really know.
Large chunks are contained within dataset for ai, we may be able to obtain or reverse engineer those.
The internet archive waybackmachine is probably the best most valid source for the public for now.
Who is they and is it publicly accessible?
I bet there are multiple copies of various depth by various groups including reddit themselves,. But how they will use it, wether they share it we cant really know.
Large chunks are contained within dataset for ai, we may be able to obtain or reverse engineer those.
The internet archive waybackmachine is probably the best most valid source for the public for now.
I think /r/datahoarder did
Here is one version:
https://academictorrents.com/details/7c0645c94321311bb05bd879ddee4d0eba08aaee