this post was submitted on 25 Nov 2023
1 points (100.0% liked)

Data Hoarder

24 readers
1 users here now

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time (tm) ). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

founded 10 months ago
MODERATORS
 

I was thinking about putting together an offline internet-in-a-box, but nearly all of the tutorials are hosted on a raspberry pi or some other similar, fairly portable low cost device.

I was wondering if anyone has already set up a way to host one of these off a local NAS, with a frontend I could access via browser.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 2 points 10 months ago (2 children)

I don't know for sure, but I think they generally refer to archives of specific websites that would be good to keep in case of an apocalypse. The thing that comes is the Wikipedia archive that is, at minimum, text-only. Not sure which other website archives people like to keep in case the Internet apocalypse.

[–] [email protected] 1 points 10 months ago

they generally refer to archives of specific websites that would be good to keep in case of an apocalypse

How to say archiving Pornhub with saying archiving Pornhub.

[–] [email protected] 1 points 9 months ago (1 children)

Each site you want will different. Wikipedia is ...a wiki site. But, YouTube is a video site. Reddit is a message board. There are blog type sites. There are download sites, Google docs is highly interactive. All just different, a few may be the same-ish. And then, you can't really fit the Internet on a box in your house. And the cost to host it elsewhere will be astronomical. Sounds like you have a few in mind? Find out what software they run, run it locally, and then download the site you can ant to mirror. You should be able to locate a crawler that will build the database(a) locally. Definitely legwork for a whole site/set of sites. Much easier if it's a type of media: books, video, audio, PDFs, a database.

[–] [email protected] 1 points 9 months ago

I'm not sure what OP is trying to create, but most Internet-in-a-box projects I've heard are trying to recreate text-centric sites offline (Reddit for example). I doubt they are trying to create an offline version of video sites, like YouTube. The size requirements alone make it unlikely.