User:ReyBrujo/Dumps/20070228
From Wikipedia
< User:ReyBrujo | Dumps
	    	    	    
	    
Dumps
| Contents | 
[edit] February 28, 2007
[edit] External link dumps
Articles with more than 5 external links as of February 28, 2007. Only articles in the main space are considered.
| External links | Article ID | Article | 
|---|---|---|
| 100 | 1577 | ආයුෙබා්වන් | 
| 80 | 1686 | විකිපීඩියා | 
| 30 | 2456 | අන්තර්ජාලය | 
| 26 | 2571 | චිකුන්ගුන්යා | 
| 23 | 1684 | Peace | 
| 23 | 1736 | බුදු දහම | 
| 22 | 1732 | Hungary | 
| 17 | 2113 | නිදහස් මෘදුකාංග | 
| 15 | 1685 | Tamil people | 
| 14 | 1516 | ශ්රී ලංකාව | 
| 13 | 1884 | තාරකා විද්යාව | 
| 11 | 2472 | සහන (නිදහස් හා විවෘත ආපදා කළමනාකරණ මෘදුකාංග පද්දතිය) | 
| 10 | 2725 | තස්ලිමා නස්රීන් | 
| 10 | 2112 | ලිනක්ස් | 
| 10 | 1688 | සිංහල ජාතිය | 
| 9 | 1777 | නිර්වාණය | 
| 9 | 1952 | රෝමානු කතෝලික සභාව | 
| 8 | 1687 | Buddha | 
| 8 | 2223 | මීර් අභ්යවකාශ මධ්යස්ථානය | 
| 7 | 1663 | සිංහල භාෂාව | 
| 7 | 2124 | Free/Libre/Open-Source Software | 
| 7 | 2572 | උණ | 
| 6 | 1918 | වැදි ජනයා | 
| 6 | 2229 | සුනිල් සාන්ත | 
| 6 | 2109 | යුනිස්ක්රයිබ් | 
| 5 | 2269 | උබුන්ටු (ලිනක්ස් බෙදා හැරීම) | 
| 5 | 2381 | දුටු ගැමුණු රජතුමා | 
| 5 | 1801 | සිංහල අක්ෂර | 
SELECT COUNT(el_from) AS total, el_from, page_title FROM externallinks, page WHERE externallinks.el_from = page_id AND page_is_redirect = 0 AND page_namespace = 0 GROUP BY el_from ORDER BY total DESC;
[edit] External link ranking
Sites linked more than 5 times as of February 28, 2007. Only articles in the main space are considered.
| Link count | Site | 
|---|---|
| 77 | http://si.wikipedia.org | 
| 18 | http://en.wikipedia.org | 
| 8 | http://www.gnu.org | 
| 5 | http://meta.wikimedia.org | 
SELECT COUNT(el_to) AS total, SUBSTRING_INDEX(el_to, '/', 3) AS search FROM externallinks, page WHERE page_id = el_from AND page_namespace = 0 GROUP BY search ORDER BY total DESC;
[edit] Additional information
Some more information about this dump:
- 225 articles that are in the main space and not redirects
- 267 articles and redirects in the main space
- 782 pages in all namespaces
- 80 redirects in all namespaces
- 2433 external links in every namespace
- 590 external links in the main space
[edit] Very probable spambot pages
If index.php is found in a page title, it is very likely the article talk page has been created by a spambot. These pages should be deleted and protected if possible.
| Article ID | Article | 
|---|---|
| 2666 | W/index.php | 
| 2687 | MediaWiki talk:Ipb cant unblock/w/index.php | 
[edit] Possible spambot pages
Possible pages created by spambots ending with /.
| Article ID | Article | 
|---|---|
| 840 | Wikipedia:Broken/ | 
SELECT page_id, page_title, page_namespace FROM page WHERE page_title LIKE '%index.php%' OR page_title LIKE '%/wiki/%' OR page_title LIKE '%/w/%' OR page_title LIKE '%/';

