Wednesday, September 30, 2015

Wayback Machine APIs | Internet Archive

Wayback Machine APIs | Internet Archive

Wayback Machine APIs

The Internet Archive Wayback Machine supports a number of different APIs to make it easier for developers to retrieve information about Wayback capture data.

The following is a listing of currently supported APIs. This page is subject to change frequently, please check back for the latest info.

Updated on September, 24, 2013

Wayback Availability JSON API

This simple API for Wayback is a test to see if a given url is archived and currenlty accessible in the Wayback Machine. This API is useful for providing a 404 or other error handler which checks Wayback to see if it has an archived copy ready to display. The API can be used as follows:

http://archive.org/wayback/available?url=example.com

which might return:

{      "archived_snapshots": {          "closest": {              "available": true,              "url": "http://web.archive.org/web/20130919044612/http://example.com/",              "timestamp": "20130919044612",              "status": "200"          }      }  }  

if the url is available. When available, the url is the link to the archived snapshot in the Wayback Machine At this time, archived_snapshots just returns a single closest snapshot, but additional snapshots may be added in the future.

If the url is not available (not archived or currently not accessible), the response will be:

{"archived_snapshots":{}}  

Other Options

Additional options which may be specified are timestamp and callback

Memento API

The Internet Archive Wayback Machine is also fully compliant with the Memento Protocol The Memento API provides additional interfaces for querying snapshots (eg 'Mementos') in the Wayback Machine. The Availability API is partially based on the Memento APIs.

Here are some specific examples of Memento support in the Wayback Machine

Wayback CDX Server API

The CDX Server is another API which allows for complex querying, filtering and analysis of Wayback capture data. If you are looking for more in depth information about Wayback machine data, please take a look at the CDX server API.

The latest documentation on the CDX server can be found at: Wayback CDX Server @ GitHub



^ed

No comments:

Post a Comment