Overview edit

The list below shows deleted pages (that have not been recreated) in the main namespace with the greatest number of incoming links from the main namespace. It shows the top 250 entries only (of approximately 80,000). It was generated on Wikimedia labs by TB (talk) 07:54, 20 May 2015 (UTC)[reply]

Relevant policy/guidelines edit

  • (A) - Wikipedia:Red_link#Dealing with existing red links - "a plausible red link might be to driving in Bangladesh, since an article on driving in the United States exists, and country-specific driving articles like these are a likely area for future creation"
  • (B) - Wikipedia:Deletion policy#Copyright violations - deletion of an article for copyright violation does not attest to the un-notability of that subject.
  • (C) - Article deleted as spam/gibberish with no comment as to it's notability. It would seem an article being 'so bad that deletion is a net gain for Wikipedia' is not as high a bar as one would think.
  • (D) - Article deleted (AfD, PROD or CSD) with prejudice.
  • (E) - Article deleted (AfD, PROD or CSD) without prejudice.

Report edit

Regenerating edit

-- Find all deleted page titles with visible old revisions

CREATE TABLE deletedpages AS
SELECT DISTINCT ar_namespace as 'dp_namespace', ar_title as 'dp_title'
FROM enwiki_p.archive
WHERE ar_deleted = 0;

-- Remove all titles that still exist or have been recreated

DELETE d
FROM deletedpages d
INNER JOIN enwiki_p.page ON page_namespace = d.dp_namespace and page_title = d.dp_title;

-- Count the number of incoming links to each remaining title from namespace 0

ALTER TABLE deletedpages ADD COLUMN ( c int(5) NULL );

UPDATE deletedpages
SET c = ( SELECT count(*)
FROM enwiki_p.pagelinks 
INNER JOIN enwiki_p.page ON pl_from = page_id
WHERE page_namespace = 0
AND pl_namespace = dp_namespace 
AND pl_title = dp_title )
WHERE c IS NULL
LIMIT 300000;


select concat( '*[[', dp_title, ']] - ', c, ' links' )
from deletedpages
where dp_namespace = 0
order by c desc
limit 250;