Archive.is

archive.is
Archive.is.jpg
Archive.is-Screenshot.png
Screenshot of archive.is
Type of site
Web archiving
Available inMultilingual
Website
Alexa rankPositive decrease 2,897 (July 2018)[1]
CommercialNo
RegistrationNo
Launched2012; 6 years ago (2012)
Current statusOnline

archive.is (formerly archive.today) is an archive site which stores snapshots of web pages.[2] It retrieves one page at a time similar to WebCite, smaller than 50 MB each, but with Web 2.0 sites (such as Google Maps and Twitter) included.

Archive.is uses headless browsing to record what embedded resources need to be captured to provide a high-quality memento, and creates a PNG image to provide a static and non-interactive visualization of the representation.[3]

Unlike crawlers such as Wayback Machine, archive.is only captures individual pages in response to explicit user requests, and so does not obey the robots exclusion standard.[4] Because of this, website owners cannot unilaterally remove snapshots at will, making it a "permanent" archive.[5][not in citation given]

Since July 2013, archive.is supports the Memento Project application programming interface (API).[6][7]

Worldwide availability

China

According to GreatFire.org, archive.is has been blocked in China since March 2016,[8] archive.li since September 2017,[9] and archive.fo since July 2018.[10]

Finland

On July 21, 2015, the operators blocked access to the service from all Finnish IP addresses, stating on Twitter that they did this in order to avoid escalating a dispute they allegedly had with the Finnish government.[11]

Russia

In Russia, only HTTP access is possible; HTTPS connections are blocked.[12][13]

Hosting

The site was protected by Cloudflare from August 16 2018 until September 2 2018.[]

Features

Archive.is records only text and images, excluding video and other non-static content. It keeps track of the history of snapshots saved, returning to the user a request for confirmation before adding a new snapshot of an already saved Internet address.[14]

At November 2018, Archive.is codifies snapshots with an alphanumeric case-sensitive code of five elements. Hence, any element may have one of 42 possible values, between:

  1. ten digits, from 0 to 9.
  2. 32 letters, 16 of the English alphabet, whose number is doubled by 16 capital letter.

The global amount of possible combinations is given by the conditional probability theorem, as follows: milions of identifiers. The number of available URIs can be additionally extended with the use of plus and minus, colon, underscore or slash signs.

Web pages cannot be duplicated from http://archive.is to http://www.archive.org as second-level backup. The reverse - from www.archive.org to archive.is - is possible,[15] but the copy usually takes more time than a direct capture. Some web sites can't be saved by either Internet Archive or archive.is due to their robots.txt file.

The research toolbar enables advanced keywords operators, using * as the wildcard character. A couple of quotation marks address the search to an exact sequence of keywords present in the title or in the body of the webpage, whereas the insite operator restricts it to a specific Internet domain[16].

Once a web page is archived, it cannot be deleted directly by any Internet[17].

See also

References

  1. ^ "Archive.is Site Info". Site Info. Alexa Internet. Retrieved 2015.
  2. ^ Martin Brinkmann (22 April 2015). "Create publicly available web page archives with Archive.is". Ghacks. Retrieved 2015.
  3. ^ Brunelle, Justin F.; Kelly, Mat; Weigle, Michele C.; Nelson, Michael L. (25 January 2015). "The impact of JavaScript on archivability". International Journal on Digital Libraries. 17 (2): 95-117. doi:10.1007/s00799-015-0140-8.
  4. ^ Dascalescu, Dan (18 February 2013). "Web page archiving - Dan Dascalescu's Wiki (review)". Wiki.dandascalescu.com. Retrieved 2013.
  5. ^ Koebler, Jason (29 October 2014). "Dear GamerGate: Please Stop Stealing Our Shit". Motherboard.
  6. ^ Nelson, Michael L. (9 July 2013). "Archive.is Supports Memento". Research and Teaching Updates. Web Science and Digital Libraries Research Group at Old Dominion University. Archived from the original on 27 July 2013. Retrieved 2013.
  7. ^ "archive.is". Memento Protocol Information. Memento Development Group. Archived from the original on 15 September 2013. Retrieved 2013.
  8. ^ "archive.is is 100% blocked in China". GreatFire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
  9. ^ "archive.li is 100% blocked in China". Great Fire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
  10. ^ "archive.fo is 100% blocked in China". Great Fire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
  11. ^ Lapintie, Lassi (22 July 2015). "Suomalaisilta estettiin haktivistien suosimalla verkkosivulla käynti" [Finns' access to website used by hacktivists blocked]. Iltalehti (in Finnish). Retrieved 2016.
  12. ^ Elistratov, Vladimir (29 January 2016). " archive..., -". TJournal (in Russian). Retrieved 2016.
  13. ^ Cushing, Tim (4 February 2016). "Russia Blocks Another Archive Site Because It Might Contain Old Pages About Drugs". Techdirt. Retrieved 2016.
  14. ^ "Example snapshot history on archive.is".
  15. ^ "Example: Page saved from Web Archive to Archive.is". Archived from the original on 2013-05-17.
  16. ^ For example, the string insite: https://en.wikipedia.org "World Cup" returns the "World+Cup"/ related snapshots
  17. ^ com/amp/s/blog.archive.is/post/41395737942/how-can-i-delete-an-archived-page/amp "Some Frequently Asked Question" Check |url= value (help). archive.is blog. Archived from the original on Sep 26, 2013. Retrieved 2018.

External links


  This article uses material from the Wikipedia page available here. It is released under the Creative Commons Attribution-Share-Alike License 3.0.

Archive.is
 



 

Connect with defaultLogic
What We've Done
Led Digital Marketing Efforts of Top 500 e-Retailers.
Worked with Top Brands at Leading Agencies.
Successfully Managed Over $50 million in Digital Ad Spend.
Developed Strategies and Processes that Enabled Brands to Grow During an Economic Downturn.
Taught Advanced Internet Marketing Strategies at the graduate level.


Manage research, learning and skills at defaultlogic.com. Create an account using LinkedIn to manage and organize your omni-channel knowledge. defaultlogic.com is like a shopping cart for information -- helping you to save, discuss and share.


  Contact Us