Type of site
|Launched||March 14, 2006|
Amazon S3 (Simple Storage Service) is a web service offered by Amazon Web Services. Amazon S3 provides storage through web services interfaces (REST, SOAP, and BitTorrent). Amazon launched S3 on its fifth publicly available web service, in the United States in March 2006 and in Europe in November 2007.
Amazon says that S3 uses the same scalable storage infrastructure that Amazon.com uses to run its own global e-commerce network.
Amazon S3 is reported to store more than 2 trillion objects as of April 2013 This is up from 10 billion as of October 2007, 14 billion in January 2008, 29 billion in October 2008, 52 billion in March 2009, 64 billion objects in August 2009, and 102 billion objects in March 2010. S3 uses include web hosting, image hosting, and storage for backup systems. S3 guarantees 99.9% monthly uptime service-level agreement (SLA), that is, not more than 43 minutes of downtime per month..
Amazon does not make details of S3's design public, though it clearly manages data with an object storage architecture. According to Amazon, S3's design aims to provide scalability, high availability, and low latency at commodity costs.
S3 stores arbitrary objects (computer files) up to 5 terabytes in size, each accompanied by up to 2 kilobytes of metadata. Objects are organized into buckets (each owned by an Amazon Web Services account), and identified within each bucket by a unique, user-assigned key. Amazon Machine Images (AMIs) which are used in the Elastic Compute Cloud (EC2) can be exported to S3 as bundles.
Buckets and objects can be created, listed, and retrieved using either a REST-style HTTP interface or a SOAP interface. Additionally, objects can be downloaded using the HTTP GET interface and the BitTorrent protocol.
Requests are authorized using an access control list associated with each bucket and object.
Bucket names and keys are chosen so that objects are addressable using HTTP URLs:
http://bucket/key(where bucket is a DNS CNAME record pointing to bucket.s3.amazonaws.com)
Because objects are accessible by unmodified HTTP clients, S3 can be used to replace significant existing (static) web hosting infrastructure. The Amazon AWS Authentication mechanism allows the bucket owner to create an authenticated URL with time-bounded validity. That is, someone can construct a URL that can be handed off to a third-party for access for a period such as the next 30 minutes, or the next 24 hours.
Every item in a bucket can also be served up as a BitTorrent feed. The S3 store can act as a seed host for a torrent and any BitTorrent client can retrieve the file. This drastically reduces the bandwidth costs for the download of popular objects. While the use of BitTorrent does reduce bandwidth, AWS does not provide native bandwidth limiting and as such users have no access to automated cost control. This can lead to users on the "free-tier" S3 or small hobby users amassing dramatic bills. AWS representatives have previously stated that such a feature was on the design table from 2006 to 2010 but have recently stated the feature is no longer in development.
At its inception, Amazon charged end users US$0.15 per gigabyte-month, with additional charges for bandwidth used in sending and receiving data, and a per-request (get or put) charge. On November 1, 2008, pricing moved to tiers where end users storing more than 50 terabytes receive discounted pricing.
Amazon S3 provides options to host static websites with Index document support and error document support. This support was added as a result of user requests dating at least to 2006. For example, suppose that Amazon S3 was configured with CNAME records to host http://subdomain.example.com/. In the past, a visitor to this URL would find only an XML-formatted list of objects instead of a general landing page (e.g., index.html) to accommodate casual visitors. Now, however, websites hosted on S3 may designate a default page to display, and another page to display in the event of a partially invalid URL.
Photo hosting service SmugMug has used S3 since April 2006. They experienced a number of initial outages and slowdowns, but after one year they described it as being "considerably more reliable than our own internal storage" and claimed to have saved almost $1 million in storage costs.
There are various User Mode File System (FUSE)-based file systems for Unix-like operating systems (Linux, etc.) that can be used to mount an S3 bucket as a file system. Note that as the semantics of the S3 file system are not that of a Posix file system, the file system may not behave entirely as expected.
Apache Hadoop file systems can be hosted on S3, as its requirements of a file system are partially met by S3. As a result, Hadoop can be used to run MapReduce algorithms on EC2 servers, reading data and writing results back to S3.
Netflix uses Amazon Web Services for their storage and compute operations with S3 being their system of record. To address the consistency limitations of S3, Netflix implemented a tool, S3mper. This stores the filesystem metadata: filenames, directory structure and permissions in Amazon DynamoDB.
S3 was used in the past by some enterprises as a long term archiving solution, until Amazon Glacier was released.
The API has become a popular method for object storage. As a result, more and more applications have been built to natively support the S3 API. This includes applications that write data to AWS S3, as well as to S3-compatible object stores:
|Client Backup||Haystack Software LLC||Arq backup|
|Client Backup||CloudBerry Lab||CloudBerry Backup|
|MySQL Backup||Oracle||MySQL Enterprise Backup|
|Oracle Database Backup||Oracle||Oracle Secure Backup Cloud Manager|
|Server Backup||Asigra||Asigra Cloud Backup|
|Cloud Storage Gateway||CTERA Networks||C00 Series|
|Cloud Storage Gateway||Avere||FXT Series|
|Cloud Storage Gateway||EMC||CloudArray|
|Cloud Storage Gateway||Microsoft||StorSimple|
|Cloud Storage Gateway||Nasuni||NF Series|
|Cloud Storage Gateway||NetApp||Altavault|
|Cloud Storage Gateway||Panzura||Global File System|
|Sync & Share||Storage Made Easy||SME|
|Hybrid Storage||Cloudian||Cloudian HyperStore|
|Hybrid Storage||NooBaa||NooBaa Storage|
Amazon S3 allows users to enable or disable logging. If enabled, the logs are stored on Amazon S3 buckets which can then be analyzed. These logs contain useful information like,
These logs can be analyzed and managed by using third-party tools such as S3Stat, Cloudlytics, Qloudstat, AWS Stats or Splunk.
The broad adoption of Amazon S3 and related tooling has given rise to competing services based on the S3 API. These services use the standard programming interface; however, they are differentiated by their underlying technologies and supporting business models. A cloud storage standard (like electrical and networking standards) enables competing service providers to design their services and clients using different parts in different ways yet still communicate and provide the following benefits:
Examples of competing S3 compliant storage implementations include:
Manage research, learning and skills at defaultLogic. Create an account using LinkedIn or facebook to manage and organize your IT knowledge. defaultLogic works like a shopping cart for information -- helping you to save, discuss and share.