Possible error with the bot

I think there's something wrong with CorenSearchBot: it thinks Cassius Clark Thompson House is a copyvio of this page, which doesn't even mention Thompson or his house. Nyttend (talk) 00:17, 2 February 2011 (UTC)

Thanks for the quick help! Nyttend (talk) 17:11, 2 February 2011 (UTC)

History of Edinburgh

Corenbot seemed to have a problem with the first para which appeared on the website noted. I've now removed this, and removed the Corenbot tag. Berek (talk) 22:52, 5 February 2011 (UTC)

Sweet spot (sport)

[1]. The cited ext link clearly refers to wikipedia. Can you make you bot a bit smarter? Loew Galitz (talk) 00:19, 2 February 2011 (UTC)

(talk page stalker) - cut & paste split to disambig with unclear edit summaries - repaired. MLauba (Talk) 11:12, 8 February 2011 (UTC)

Joseph El Khazen

A tag has been inserted in the new Article Joseph El Khazen saying "The CorenSearchBot has performed a web search with the contents of this page, and it appears to include material copied directly from: http://www.khazen.org/genealogical_tree". Surely the CorenSearchBot made a mistake, as only the name of the Patriarch's family can be found in such a page (as well as some other Maronite surnames). Anyway no text has been copied by such site, as clearly anyone can check. Thus I remove the tag.A ntv (talk) 21:48, 7 February 2011 (UTC)

Bot bug

Any reason that it would mark Dangdang as a copyvio of [2] (which isn't even the company I'm writing about, incidentally) other than the fact that the word "Dangdang" appears a lot in both? -- King of ♠ 10:11, 8 February 2011 (UTC)

(talk page stalker) Very short articles have a higher risk of generating false positives. MLauba (Talk) 10:34, 8 February 2011 (UTC)

Penetrant (mechanical, electrical or structural) vs. Penetrating item

Hi. I was trying to delete a redirect from Penetrant (mechanical, electrical or structural) to Penetrating item but had no luck. I tried the undo link numerous times and it did not work. I could not even see a record of my attempt to do so. So I recreated the original and this bot picked up on the fact that there are now 2 identical articles. I'd like to get rid of the Penetrating item one but I don't know how. Please help. Thanks, --Achim (talk) 13:20, 14 February 2011 (UTC)

It's all gone. — Coren (talk) 13:24, 14 February 2011 (UTC)

Very quiet

It's gotten very quiet around here. Thank you for speaking up. Bishonen | talk 23:08, 14 February 2011 (UTC).

I'm... not entirely sure what you're talking about? My talk page tends to be on the quiet side, with most of the chatter related to my bot. Once every so often, dramahz occur; but that's about as reliable as storms and floods (i.e.: unavoidable in the long run but good luck predicting it). — Coren (talk) 00:55, 15 February 2011 (UTC)
I'm taking that as a personal attack. Short Brigade Harvester Boris (talk) 01:24, 15 February 2011 (UTC)
Okay; is there an obvious joke I'm being oblivious to? — Coren (talk) 01:29, 15 February 2011 (UTC)
No. I wasn't talking about your page. This. Bishonen | talk 13:32, 15 February 2011 (UTC).
Oh, that. Well, as far as soundness goes, the whole accusation lies somewhere around the second shooter on the grassy knoll, you know. I don't think anyone was likely to take it seriously, but such screeds are hurtful nonetheless. Besides, that just not the kind of thing you'd do. (I mean, 'zilla might, though there would be no subtle threats or veiled menace — lots of teeth 'tho). — Coren (talk) 00:25, 16 February 2011 (UTC)

The Sin of Nora Moran

Hello! Please review this edit from your bot. Non-creative list of information is not creative content. --Bensin (talk) 14:08, 10 February 2011 (UTC)

It's not, but the bot can't guess whether creativity is involved or not. :-) — Coren (talk) 01:22, 11 February 2011 (UTC)
I guess not. That would be a nice (and probably Nobel-prize-winning) feature :-) But I guess the bot uses some sort of weighing system? Perhaps the bot should be less sensitive to some pages on some sites that contain more data than creative work (for example the cast list pages on IMDb). Or perhaps some sort of system for white-listing pages? --Bensin (talk) 14:27, 11 February 2011 (UTC)
Sites can be whitelisted, but many possible sources of bland lists are also often sources for creative text that must be flagged. IMDb is a good example; many have the bad habit of pasting synopses from there (which are a problem) and casts or filmographies (which aren't). — Coren (talk) 16:12, 11 February 2011 (UTC)
Then it's perhaps possible to whitelist the casts and filmographies section of IMDb. Is it a fair guess that they generate a fair amout of false positives? --Bensin (talk) 16:33, 14 February 2011 (UTC)
Not as much as you'd think: they tend to raise flags when they're the only thing in an article, but even a paragraph or two of proses tends to be enough that the 'bot notes the relative significance.

Part of the problem lies that whitelisting the right bits of IMDb wouldn't help: it wouldn't match the lists there, but things like casts or list of tracks are found all over the 'net and it'd find another copy elsewhere to match instead. — Coren (talk) 16:41, 14 February 2011 (UTC)

There's a reason why I'm pushing this issue. A bot can do a lot of work in a short amout of time. If a bot does something it's not supposed to, like tagging a false positive in the case of your bot, it generates unnecessary work for someone else who has to clean it up. In my case I sifted through copyright policies to find the link about "Non-creative list of information..." above. Could you please take a look at the false positives and see if something can be done to minimize the impact them? --Bensin (talk) 15:30, 16 February 2011 (UTC)

History of numerical weather prediction flagged

Your bot flagged the new article, which took much content from the numerical weather prediction article, which was mirrored on another web site, which caused the bot to flag the page as being copied from said mirrored website. Could you please fix this? Thanks. Thegreatdr (talk) 17:23, 16 February 2011 (UTC)

Geomajas

Corenbot rightfully indicated a similarity with a previously deleted wikipage of the Geomajas Opens Source Project. (As a result) the Geomajas has also been removed again from wikipedia. Although we re-created the new Geomajas page starting from the previous page, it is not a pure copy but rather an adapted version. Biggest difference is that since November 2011 Gemajas has graduated as an OSGeo project. Geomajas is also being referred in at least 2 other GIS related wikipedia articles. So we would like to propose it again as a wikipedia entry. Where do we start best? Thanks for your help! Frankenmaes (talk) 12:09, 8 February 2011 (UTC)

CSB

 
Hello, Coren/Archives/2011. Please check your email; you've got mail!
It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{You've got mail}} or {{ygm}} template.

VernoWhitney (talk) 23:13, 20 February 2011 (UTC)

Uh, it's not copyrighted

Cibecue Community School was flagged by Coren for being a duplicate of some Classmates.com page, which I never consulted in my research. (I consulted BIE sources, the AIA index, and one ADE source.)

Schools like these – without a website and with little information on the BIE page – are very hard to write about. Greyhills Academy High School was like that too. Raymie (tc) 03:20, 22 February 2011 (UTC)

lesbian (band)

I received a note that this page is similar to lesbianwitch.com. when I tried to visit that site I found that it was not in english. If there are similarities, it is a remarkable coincidence. — Preceding unsigned comment added by Lesbroham (talkcontribs) 19:54, 24 February 2011 (UTC)

Wikipedia mirror

Just declined a speedy based on CSBot's comparison of a page with wikibin.org, which appears to be a mirror/fork. Thought I'd best drop you a note :) Regards, - Jarry1250 [Who? Discuss.] 19:41, 25 February 2011 (UTC)

Corenbot seeing copyright violations Postage stamps and postal history of Sint Maarten

Corenbot just suspected a copyright violation, where actually the site it pointed to had copied from wikipedia. I remember the last time I started an article from scratch months ago it did exactly the same (United States Permanent Representative to the Organisation for the Prohibition of Chemical Weapons). Some adjustment might be in order; or is there something I do that triggers this bot? L.tak (talk) 19:09, 26 February 2011 (UTC)

I have been thinking about what I did that might have triggered Corenbot. But the question possibly is more: what triggered those wp-mirrors to be so fast. I added the article complete with templates and categories. And that might have been the reason it was picked up within the second by the mirror. What do you think? L.tak (talk) 22:40, 26 February 2011 (UTC)

Arbcom-bashing..

..is all very well in its place, but the Rodhullandemu problem is not that place. I agree with what you say here. Bishonen | talk 20:53, 26 February 2011 (UTC).

Archelaus (father of Archelaus of Cappadocia) article

Hi there

This is regarding the above named article I have just added on Wikipedia.

I have not broken any copyright rules!

This article is about a Greek nobleman that lived in the 1st century BC and was a High Priest Ruler of the temple state of Comana Cappadocia.

If you have a look at the sources mentioned in the article, there is no mentioned of this website of Archelaus' cards.

I have never seen this website in my life. You may have mistaken this website to another website. Please if you can tag off this article from being deleted.

From

Anriz. — Preceding unsigned comment added by Anriz (talkcontribs) 07:01, 23 February 2011 (UTC)

pass-out teh baked goodies

— Preceding unsigned comment added by Gold Hat (talkcontribs) 13:15, 26 February 2011 (UTC)