File sharing

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 82.174.179.110 (talk) at 10:50, 2 July 2007 (→‎The first P2P-generation: Server-client). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

File sharing is the practice of making files available for other users to download over the Internet and smaller networks. Usually file sharing follows the peer-to-peer (P2P) model, where the files are stored on and served by personal computers of the users. Most people who engage in file sharing are also downloading files that other users share. Sometimes these two activities are linked together. P2P file sharing is distinct from file trading in that downloading files from a P2P network does not require uploading, although some networks either provide incentives for uploading such as credits or force the sharing of files being currently downloaded.

The first P2P-generation: Server-client

The first generation of peer-to-peer file sharing networks had a centralized file list. Ultimately, Napster was held liable even if it used the most advanced technology available to identify works copyright holders had asked it to block, because no technology that can identify works with 100% certainty exists. Napster still exists today, but as a subsidiary of Roxio after they bought the name during the original Napster's bankruptcy phase, continues to operate today, and is now legally distributing music under a subscription-based model.

In the centralized peer-to-peer model, a user would send a search to the centralized server of what they were looking for, that is, a song, video, or movie. The server then sends back a list of which peers have the data and facilitates the connection and download.

The first file sharing programs marked themselves by inquiries to a server, either the data to the Download held ready or in appropriate different Peers and so-called Nodes further-obtained, so that one could download there. Best example was Napster (today a Payment offerer) or eDonkey2000 in the server version (today likewise with Overnet and KAD - network decentralized).

Web based sharing

You must add a |reason= parameter to this Cleanup template – replace it with {{Cleanup|section|reason=<Fill reason here>}}, or remove the Cleanup template.
Webhosting for file sharing is also used. Since it is possible to exchange privately and in small communities popular files can be distributed very quickly and efficiently. Web hosters are independent from each other, therefore contents are not distributed further. Other terms for this are one-click hosting and web based sharing.

Server-client-protocols

  • Audiogalaxy - Service ended in the middle of 2002.
  • Direct Connect
  • Napster - Closed in its original form in July 2001, since changed to a fee-based service.
  • Scour Exchange - The second exchange network after Napster. No longer exists.
  • Soulseek - Still popular today despite being relatively old, with more than 120,000 users online at any time.
  • TinyP2P - 15 lines Python - SOURCE code
  • WinMX - The original servers were switched off in September 2005, but servers from other companies have taken over. Version 3.54 beta 4 was available from http://www.winmxgroup.com/ however was shut down June 29,2007. How this will affect the future of WinMX is unknown at this time.

The second P2P-Generation: Decentralization

After Napster encountered legal troubles, Justin Frankel of Nullsoft set out to create a network without a central index server, and Gnutella was the result. Unfortunately, the Gnutella model of all nodes being equal quickly died from bottlenecks as the network grew from incoming Napster refugees. FastTrack solved this problem by having some nodes be 'more equal than others'.

By electing some higher capacity nodes to be indexing nodes, with lower capacity nodes branching off from them, FastTrack allowed for a network that could scale to a much larger size. Gnutella quickly adopted this model, and most current peer-to-peer networks implement this design, as it allows for large and efficient networks without central servers.

Also included in the second generation are distributed hash tables (DHTs), which help solve the scalability problem by electing various nodes to index certain hashes (which are used to identify files), allowing for fast and efficient searching for any instances of a file on the network. This is not without drawbacks; perhaps most significantly, DHTs do not directly support keyword searching (as opposed to exact-match searching).

The best examples are Gnutella, Kazaa or eMule with Kademlia, whereby Kazaa has still a central server for logging in. eDonkey2000/Overnet, Gnutella, FastTrack and Ares Galaxy have summed up approx. 10.3 million users (as of April 2006, according to slyck.com). This number not necessarily corresponds to the actual number of persons who use these networks; it must be assumed that some use multiple clients for different networks. The number of BitTorrent users cannot be measured directly, however, there should be more users than of eDonkey2000 and that's why providers try to restrict the BitTorrent traffic first. One must also understand that number of users indicated by software applies only to users which are active at this moment.

Multi-Network-Clients

eDonkey2000 - Network

Gnutella - Network

BitTorrent - Network

See also: Comparison of BitTorrent software

FastTrack - Network

Further networks or clients

The third P2P-Generation: indirect and encrypted

The third generation of peer-to-peer networks are those that have anonymity features built in. Examples of anonymous networks are ANts P2P, RShare, Freenet, I2P, GNUnet and Entropy.

A degree of anonymity is realized by routing traffic through other users clients, which have the function of network nodes.This makes it harder for someone to identify who is downloading or who is offering files. Most of these programs also have strong encryption to resist traffic sniffing.

Friend-to-friend networks only allow already known users (also known as "friends") to connect to your computer, then each node can forward requests and files anonymously between its own "friends" nodes.

Third generation networks have not reached mass usage for file sharing because most current implementations incur too much overhead in their anonymity features, making them slow or hard to use.

An example might be: Petra gives a file to Oliver, then Oliver gives the file to Anna. Petra and Anna thus never become acquainted and thus are protected. Virtual IP addresses are also often used, further obfuscating the user's network location. Additionally all transfers are encrypted, so that even the network administrators cannot see what was sent to whom. Example software includes WASTE, JetiANts, Tor and I2P. These clients differ greatly in their goals and implementation. WASTE is designed only for small groups and may therefore be considered Darknet, ANts and I2P are public Peer to Peer systems, with anonymization provided exclusively by routing reach.

Waste network

  • WASTE
  • kDrive similar to WASTE, development stopped at present

Ants network

Mute network

I2P network

Retroshare-Network (F2F Instant Messenger)

other networks or clients

The fourth P2P-Generation: Streams over P2P

Apart from the traditional file sharing there are services that send streams instead of files over a P2P network. Thus one can hear radio and watch television without any server involved -- the streaming media is distributed over a P2P network. It is important that instead of a treelike network structure, a swarming technology known from BitTorrent is used. Best examples are Peercast, Cybersky and demo TV.

General

Tree structure

Swarm structure such as Bittorrent

Legal and copyright issues

File sharing grew in popularity with the proliferation of high speed Internet connections, relatively small file size and high-quality MP3 audio format. Although file sharing is a legal technology with legal uses, many users use it to download and upload copyrighted materials without permission, which can be copyright infringement if done without authorization for improper purposes. Despite the existance of various international treaties, there are still sufficient variations between countries to cause significant difficulties in the protection of intellectual property. This has led to attacks against file sharing in general from many copyright owners and litigation by industry bodies against private individual sharers.

The challenges facing copyright holders in the face of file sharing systems are quite novel historically and have highlighted many new challenges in both theory and practice:

  • Ambiguities in the interpretation of copyright law
  • The new challenges posed by international communications and varying legislations
  • Mass litigation and the development of processes for evidence and discovery
  • Rapidly developing new technologies and uses
  • Low barriers to entry by would-be sharers and the development of a mass usage of the technologies
  • File sharing approaches developed in response to litigation against sharers, which obfusticate or hide the fact that sharing is happening, or the identities of those involved. For example: encryption and darknets.

Further challenges have arisen because of the need to balance self-protection against fair use. A perceived overbalance towards protection (in the form of media that cannot be backed-up, cannot be played on multiple systems by the owner, or contains rootkits[1] or irksome security systems inserted by manufacturers) has led to a backlash against protection systems in some quarters. For example, the first crack of AACS was inspired by a perceived unfair restriction on owner usage.[2]

Attacks on peer-to-peer networks

Many peer-to-peer networks are under constant attack by people with a variety of motives.

Examples include:

  • poisoning attacks (e.g. providing files whose contents are different from the description, aka "spoofing")
  • polluting attacks (e.g. inserting "bad" chunks/packets into an otherwise valid file on the network)
  • defection attacks (users or software that make use of the network without contributing resources to it)
  • insertion of viruses to carried data (e.g. downloaded or carried files may be infected with viruses or other malware)
  • malware in the peer-to-peer network software itself (e.g. distributed software may contain spyware)
  • denial of service attacks (attacks that may make the network run very slowly or break completely)
  • filtering (network operators may attempt to prevent peer-to-peer network data from being carried)
  • identity attacks (e.g. tracking down the users of the network and harassing or legally attacking them)
  • spamming (e.g. sending unsolicited information across the network- not necessarily as a denial of service attack)

Risks

Some file sharing software comes bundled with malware such as spyware or adware. Sometimes this malware remains installed on the system even if the original file sharing software is removed, and can be very difficult to eliminate. In many cases such malware can interfere with the correct operation of web browsers, anti-virus software, anti-spyware and software firewalls, and can cause degraded performance on affected systems. Such malware is typically bundled with proprietary software, and not those in open source. In most cases it is possible to easily remove adware and spyware by running spyware removal programs. Such programs can often remove malware without influencing the functionality of the file sharing software.

Some are also concerned about the use of file sharing systems to distribute child pornography, inflammatory literature, and illegal or unpopular material. Novice users may find it difficult to obtain information on which networks are "safe" for them to use. However, experienced users know that there is only one way to get in contact with such material: You have to actively search for it.[3] Therefore they recommend not to search for illegal material.

See also

References

  1. ^ See 2005 Sony BMG CD copy protection scandal.
  2. ^ By "Muslix64", written on doom9's forum. See original post and the ensuing AACS encryption key controversy.
  3. ^ Morris, Alan (2003-08-22). "Testimony of Mr. Alan Morris about Pornography, Technology and Process: Problems and Solutions on Peer-to-Peer Networks". United States Senate, Committee on the Judiciary, Washington, DC. Retrieved 2006-11-20.

Further reading

External links