Wikipedia:Bots/Requests for approval/GreenC bot 6
- The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Withdrawn by operator.
Operator: GreenC (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 15:38, Friday, July 20, 2018 (UTC)
Automatic, Supervised, or Manual: Automatic
Programming language(s): BotWikiAwk
Source code available: Yes
Function overview: Convert New York Times archives from old to new format. Example.
Links to relevant discussions (where appropriate): Wikipedia:Bot_requests#New_York_Times_archives_moved
Edit period(s): one-time
Estimated number of pages affected: 29,707 links (fewer pages)
Namespace(s): Mainspace
Exclusion compliant (Yes/No): yes
Function details: The New York Times is an important citation source on Wikipedia. Keeping the archive URLs up to date will ensure there is no link-rot if/when redirects stop working in the future. The new URL is more informative with date information and file type (PDF in the example), the later affects the display output of CS1|2 templates.
The bot works by checking the page header of the old url, looking for Location: of the redirect and testing it works then replacing in the wikisource. It will leave any web archived URLs as-is eg. any NYT links archived at the WaybackMachine.
Discussion
edit- Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. please report back here when done trial, include diff range. — xaosflux Talk 16:08, 20 July 2018 (UTC)[reply]
- User:Xaosflux, looking more closely at the data I made a mistake. There are not 29,707 links, closer to 200. Most of the links in query.nytimes.com are for a different type of page ([1]) not the timesmachine.nytimes.com. There's also special cases that make the bot more complex than I realized. And I'm now confused how the Times has its site organized, to confidently change the URLs to the redirects. Probably best to close this out for now until it's more clear what should be done. -- GreenC 01:10, 21 July 2018 (UTC)[reply]
- Withdrawn by operator. per GreenC above. — xaosflux Talk 01:53, 21 July 2018 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.