2015-01-12

Keeping Track of the Web: the HTTP Archive, and the Internet Archive

Most people have heard of the Internet Archive, but do you know about the HTTP Archive?

The Internet Archive (a/k/a the "Wayback Machine") keeps track of Web Content over time, while the HTTP Archive keeps track of how digital content is contructed and served on the Web over time.

About the HTTP Archive: "... In 1996 Brewster Kahle realized the cultural significance of the Internet and the need to record its history. As a result he founded the Internet Archive which collects and permanently stores the Web's digitized content. In addition to the content of web pages, it's important to record how this digitized content is constructed and served. The HTTP Archive provides this record. It is a permanent repository of web performance information such as size of pages, failed requests, and technologies utilized. This performance information allows us to see trends in how the Web is built and provides a common data set from which to conduct web performance research... "

Average Page Weight Increases 15% in 2014 - (sitepoint.com): "... The latest figures indicate that average page weight has increased by 15% in one year to reach 1,953Kb — a little under 2Mb — and comprises 95 individual HTTP requests. While this is smaller than the 32% increase in 2013, it remains cause for concern. The report analyzes publicly-accessible content and shopping web sites rather than complex web applications and provides a breakdown of the technologies used: HTML; CSS; JavaScript; Images; Flash; Other...."

The Internet Archive and the HTTP Archive are both useful sources and tools for web developers, designers, creators, publishers, and domain name registrants, or anyone in the domain name industry or involved with internet governance.



Domain Mondo archive