Internet saving/downloading (theory and practice)?

ByBenny354912 March 20, 2024March 29, 2025

Hey, I realize that you can NEVER save the entire internet. Simply because it changes from moment to moment, and much of the data isn't freely accessible (e.g., password-protected areas).

However, I'm still wondering whether it's possible to save and display the "freely accessible" internet in a few GB size. I'm asking this question because it's already possible to install a module, such as GPT4ALL, that has a lot of knowledge but is only a few GB in size.

I think we should minimize the theory to just websites for now, so it remains easier to understand and avoids many problems for the time being. At least, that's what I think.

Thank you in advance and please understand my dyslexia.

(1 votes)

16 Answers

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

FaTech

1 year ago

Birds – Wikipedia Simple Website, Wikipedia. Let’s take the HTML code. Only the HTML code. CSS and JS outside.

Sound UTF-8 string length & byte counter (mothereff.in) we are 202,490 characters and 205,803 bytes. Now it’s just one side. But Wikipedia has millions of them. If we take other hosters, that’s a number of impossibilities. It just gets too big and that’s just the HTML part

Ginpanse

1 year ago

No. not even possible for lack of space. It would take forever to do that. AIs like chatgpt only use apis to existing search engines. the offline versions have a lot to know but this is not even 1% of the dates of the internet.

Author

Benny354912

1 year ago

Reply to Ginpanse

Okay, they don’t even give me 1%. Would it be possible with the 1%?

Ginpanse

1 year ago

Reply to Benny354912

negative.

Destranix

1 year ago

Each side of the Internet is already stored on its respective server. However, a private user alone does not have the necessary means to store everything again redundantly.

Author

Benny354912

1 year ago

Reply to Destranix

And what does it look like if that’s what I get as an output? Just talk about finding static pages. I don’t need any functions or something, but only the opportunity to look at all pages, as I would call them, without the backend technique.
(Hoffe it is understandable)

Destranix

1 year ago

Reply to Benny354912

Even then not. That’s inconceivable. Youtube, Wikipedia, webarchive, etc. are immensely large.

Author

Benny354912

1 year ago

accepted

Destranix

1 year ago

No. The information remains the same, and you can’t compress it any more.

Author