Torrent file
In the BitTorrent file distribution system, a torrent file or meta-info file is a computer file that contains metadata about files and folders to be distributed, and usually also a list of the network locations of trackers, which are computers that help participants in the system find each other and form efficient distribution groups called swarms. A torrent file does not contain the content to be distributed; it only contains information about those files, such as their names, folder structure, and sizes obtained via cryptographic hash values for verifying file integrity. The term torrent may refer either to the metadata file or to the files downloaded, depending on the context.
A torrent file acts like a table of contents that allows computers to find information through the use of a Bittorrent client. A torrent file gives addresses identifying computers that can send parts of the requested file. With the help of a torrent file, one can download small parts of the original file from computers that have already downloaded it. These "peers" allow for downloading of the file in addition to, or in place of, the primary server.
Torrent files are normally named with the extension ".torrent".
Torrent files themselves and the method of using torrent files have been created to ease the load on central servers, as instead of sending a file to for request, it can crowd-source the bandwidth needed for the file transfer, and reduce the time needed to download large files. Many free/freeware programs and operating systems, such as the various Linux distributions and GIMP, offer a torrent download option for users seeking the aforementioned benefits. Other large downloads, such as media files, are often torrented as well.
Background
Typically, Internet access is asymmetrical, supporting greater download speeds than upload speeds, limiting the bandwidth of each download, and sometimes enforcing bandwidth caps and periods where systems are not accessible. This creates inefficiency when many people want to obtain the same set of files from a single source; the source must always be online and must have massive outbound bandwidth. The BitTorrent protocol addresses this by decentralizing the distribution, leveraging the ability of people to network "peer-to-peer", among themselves.Each file to be distributed is divided into small information chunks called pieces. Downloading peers achieve high download speeds by requesting multiple pieces from different computers simultaneously in the swarm. Once obtained, these pieces are usually immediately made available for download by others in the swarm. In this way, the burden on the network is spread among the downloaders, rather than concentrating at a central distribution hub or cluster. As long as all the pieces are available, peers can come and go; no one peer needs to have all the chunks, or to even stay connected to the swarm in order for distribution to continue among the other peers.
A small torrent file is created to represent a file or folder to be shared. The torrent file acts as the key to initiating downloading of the actual content. Someone interested in receiving the shared file or folder first obtains the corresponding torrent file, either by directly downloading it, or by using a magnet link. The user then opens that file in a BitTorrent client, which automates the rest of the process. In order to learn the Internet locations of peers which may be sharing pieces, the client connects to the trackers named in the torrent file, and/or achieves a similar result through the use of distributed hash tables. Then the client connects directly to the peers in order to request pieces and otherwise participate in a swarm. The client may also report progress to trackers, to help the tracker with its peer recommendations.
When the client has all the pieces, the BitTorrent client assemble them into a usable form. They may also continue sharing the pieces, elevating its status to that of seeder rather than ordinary peer.
File structure
A torrent file contains a list of files and integrity metadata about all the pieces, and optionally contains a list of trackers.A torrent file is a bencoded dictionary with the following keys :
- announce—the URL of the tracker
- info—this maps to a dictionary whose keys are dependent on whether one or more files are being shared:
- * files—a list of dictionaries each corresponding to a file. Each dictionary has the following keys:
- ** length—size of the file in bytes.
- ** path—a list of strings corresponding to subdirectory names, the last of which is the actual file name
- * length—size of the file in bytes
- * name—suggested filename where the file is to be saved /suggested directory name where the files are to be saved
- * piece length—number of bytes per piece. This is commonly 28 KiB = 256 KiB = 262,144 B.
- * pieces—a hash list, i.e., a concatenation of each piece's SHA-1 hash. As SHA-1 returns a 160-bit hash, pieces will be a string whose length is a multiple of 20 bytes. If the torrent contains multiple files, the pieces are formed by concatenating the files in the order they appear in the files dictionary.
Extensions
A torrent file can also contain additional metadata defined in extensions to the BitTorrent specification. These are known as "BitTorrent Enhancement Proposals." Examples of such proposals include metadata for stating who created the torrent, and when.Draft extensions
These extensions are under consideration for standardization.Distributed hash tables
BEP-0005 extends BitTorrent to support distributed hash tables.A trackerless torrent dictionary does not have an announce key. Instead, a trackerless torrent has a nodes key:
For example,
'nodes': "127.0.0.1", 6881], "your.router.node", 4804
The specification recommends that nodes "should be set to the K closest nodes in the torrent generating client's [routing table. Alternatively, the key could be set to a known good node such as one operated by the person generating the torrent."
Multiple trackers
BEP-0012 extends BitTorrent to support multiple trackers.A new key, announce-list, is placed in the top-most dictionary
HTTP seeds
BEP-0017 extends BitTorrent to support HTTP seeds.A new key, httpseeds, is placed in the top-most list. This key's value is a list of web addresses where torrent data can be retrieved:
Private torrents
BEP-0027 extends BitTorrent to support private torrents.A new key, private, is placed in the info dictionary. This key's value is 1 if the torrent is private:
Merkle trees
BEP-0030 extends BitTorrent to support Merkle trees. The purpose is to reduce the file size of torrent files, which reduces the burden on those that serve torrent files.A torrent file using Merkle trees does not have a pieces key in the info list. Instead, such a torrent file has a root_hash key in the info list. This key's value is the root hash of the Merkle hash:
Examples
Single file
Here is what a de-bencoded torrent file for a file debian-503-amd64-CD-1.iso might look like:Note: pieces here would be a 51 KiB value.