 |
|
 |
|
There has been little research conducted into the current state of SMTP (Simple Mail Transfer Protocol) servers on the Internet, perhaps because this area of research has not been particularly fashionable in comparison to the HTTP metrics which are commonly collected. SMTP is the protocol used to deliver email on the Internet. This is an important area of research however given the level of traffic served by these systems has been growing for years. Barracuda Networks cite Radicati research which indicates that in 2009 228 billion emails will be sent per day, with the vast majority being spam (see Barracuda's site for more details). Afergan and Beverly in "The state of the email address" evaluate the state of email servers in an attempt to determine how SMTP servers are coping with the growth in traffic. Their approach involved sending out probe emails to a variety of domains. The email was crafted to have a strong assurance of bouncing because of not being addressed to a valid address. The authors then monitored the bounce traffic. They concluded that corporate SMTP servers are under surprising levels of strain and do not bounce undeliverable emails in a predictable manner.
This site hosts data derived from my survey of SMTP servers commented to the Internet. The survey takes the form of a simple popularity contest, with around 46 million IP address currently surveyed. The results are based on parsing the "220 status" line that SMTP servers send a client upon connection to TCP port 25. Current results are available, as well as an interactive lookup interface if you're curious about a specific domain name. If you're interested in the methodology for the survey, you should checkout my LISA 2007 poster about this survey which is the most recent public documentation. Its a little out of date at the moment, but the latest version isn't yet ready for publication.
There have been surveys of SMTP servers conducted by others. The table below provides a summary:
| Date |
Surveyor |
Sample size |
Sample method |
Responses |
URL |
| 27 November 1996 |
DJB |
500,000 |
Selective random |
25,121 |
Site |
| 14 August 1997 |
DJB |
200,000 |
Selective random |
8,056 |
Site |
| 11 May 1998 |
DJB |
20,310 |
MX walk |
17,592 |
Site |
| 2 April 2000 |
DJB |
12,595 |
Selective random |
10,087 |
Site |
| 5 October 2000 |
DJB |
25,777 |
Random |
859 |
Site |
| 27 September 2001 |
DJB |
39,206 |
Random |
937 |
Site |
| 1 December 2002 |
Credentia |
4,096 |
Random |
1,837 |
Site |
| 1 January 2003 |
Credentia |
30,000 |
Random |
17,540 |
Site |
| 1 April 2003 |
Credentia |
37,563 |
Random |
20,410 |
Site |
| 1 May 2007 |
MailChannels |
400,000 |
Listing of domain names owned by companies |
254,400 |
O'Reilly article |
| Ongoing |
E-Soft |
Unknown |
Unknown |
Unknown |
Site (data available for a fee) |
| Ongoing |
Michael Still / Eric McCreath |
46,891,390 |
Exhaustive |
1,923,664 |
Site |
|
|
|
|
|