User:ReyBrujo/Dumps/20070228
From Wikipedia
< User:ReyBrujo | Dumps
Dumps
Contents |
[edit] February 28, 2007
[edit] External link dumps
Articles with more than 5 external links as of February 28, 2007. Only articles in the main space are considered.
| External links |
Article ID | Article |
|---|---|---|
| 100 | 1577 | ආයුෙබා්වන් |
| 80 | 1686 | විකිපීඩියා |
| 30 | 2456 | අන්තර්ජාලය |
| 26 | 2571 | චිකුන්ගුන්යා |
| 23 | 1684 | Peace |
| 23 | 1736 | බුදු දහම |
| 22 | 1732 | Hungary |
| 17 | 2113 | නිදහස් මෘදුකාංග |
| 15 | 1685 | Tamil people |
| 14 | 1516 | ශ්රී ලංකාව |
| 13 | 1884 | තාරකා විද්යාව |
| 11 | 2472 | සහන (නිදහස් හා විවෘත ආපදා කළමනාකරණ මෘදුකාංග පද්දතිය) |
| 10 | 2725 | තස්ලිමා නස්රීන් |
| 10 | 2112 | ලිනක්ස් |
| 10 | 1688 | සිංහල ජාතිය |
| 9 | 1777 | නිර්වාණය |
| 9 | 1952 | රෝමානු කතෝලික සභාව |
| 8 | 1687 | Buddha |
| 8 | 2223 | මීර් අභ්යවකාශ මධ්යස්ථානය |
| 7 | 1663 | සිංහල භාෂාව |
| 7 | 2124 | Free/Libre/Open-Source Software |
| 7 | 2572 | උණ |
| 6 | 1918 | වැදි ජනයා |
| 6 | 2229 | සුනිල් සාන්ත |
| 6 | 2109 | යුනිස්ක්රයිබ් |
| 5 | 2269 | උබුන්ටු (ලිනක්ස් බෙදා හැරීම) |
| 5 | 2381 | දුටු ගැමුණු රජතුමා |
| 5 | 1801 | සිංහල අක්ෂර |
SELECT COUNT(el_from) AS total, el_from, page_title FROM externallinks, page WHERE externallinks.el_from = page_id AND page_is_redirect = 0 AND page_namespace = 0 GROUP BY el_from ORDER BY total DESC;
[edit] External link ranking
Sites linked more than 5 times as of February 28, 2007. Only articles in the main space are considered.
| Link count | Site |
|---|---|
| 77 | http://si.wikipedia.org |
| 18 | http://en.wikipedia.org |
| 8 | http://www.gnu.org |
| 5 | http://meta.wikimedia.org |
SELECT COUNT(el_to) AS total, SUBSTRING_INDEX(el_to, '/', 3) AS search FROM externallinks, page WHERE page_id = el_from AND page_namespace = 0 GROUP BY search ORDER BY total DESC;
[edit] Additional information
Some more information about this dump:
- 225 articles that are in the main space and not redirects
- 267 articles and redirects in the main space
- 782 pages in all namespaces
- 80 redirects in all namespaces
- 2433 external links in every namespace
- 590 external links in the main space
[edit] Very probable spambot pages
If index.php is found in a page title, it is very likely the article talk page has been created by a spambot. These pages should be deleted and protected if possible.
| Article ID | Article |
|---|---|
| 2666 | W/index.php |
| 2687 | MediaWiki talk:Ipb cant unblock/w/index.php |
[edit] Possible spambot pages
Possible pages created by spambots ending with /.
| Article ID | Article |
|---|---|
| 840 | Wikipedia:Broken/ |
SELECT page_id, page_title, page_namespace FROM page WHERE page_title LIKE '%index.php%' OR page_title LIKE '%/wiki/%' OR page_title LIKE '%/w/%' OR page_title LIKE '%/';

