Wikipedia talk:Labels/Edit quality

Archive
  • 2015 (Archived 21:11, 23 August 2016 (UTC))

ORES review tool deployed! Now, let's do a new campaign to make the models more accurate. edit

Ping EpochFail (talk · contribs), He7d3r (talk · contribs), とある白い猫 (talk · contribs), Ladsgroup (talk · contribs), Noyster (talk · contribs), EoRdE6 (talk · contribs), SchreiberBike (talk · contribs), JoeSperrazza (talk · contribs), Epicgenius (talk · contribs), Stuartyeates (talk · contribs), MrX (talk · contribs), Jay8g (talk · contribs), Blackmane (talk · contribs), Coretheapple (talk · contribs), Pishcal (talk · contribs), TheMagikCow (talk · contribs), ONUnicorn (talk · contribs), Esquivalience (talk · contribs), Kharkiv07 (talk · contribs), Philippe (WMF) (talk · contribs), Sarr Cat (talk · contribs), Odeesi (talk · contribs) and Masssly (talk · contribs)

Hey folks. We just deployed the mw:ORES review tool on English Wikipedia. Check it out in your Beta features preferences. We've gotten a lot of interest in how to help improve ORES accuracy. The best way to do that is to help us gather more data. So, I've generated a new random sample of revisions to review. See "Edit quality (20k 2016 sample)" at WP:Labels. I've already auto-labeled ~13.7k edits by admins, checkusers, bureaucrats and other high privilege users, so we only have to review the remaining 6.3k edits. Assuming each workset takes 5 minutes (which is how long it took last time), that means we have a total of 10 hours of work. If we can divide that between 23 of us and recruit ~7 more, that means we can all spend less than a half an hour working. I'm planning to put in an hour or so in before the end of the week. --EpochFail (talkcontribs) 21:09, 23 August 2016 (UTC)Reply

Ping Shizhao (talk · contribs), Matma Rex (talk · contribs), Ladsgroup (talk · contribs), Strainu (talk · contribs), TutterMouse (talk · contribs), Brightgalrs (talk · contribs), PabloCastellano (talk · contribs), Alex Cohn (talk · contribs) Elvey (talk · contribs), Townie (talk · contribs), Krinkle (talk · contribs), Jay8g (talk · contribs), Matěj Suchánek (talk · contribs), Bsivko (talk · contribs), Noyster (talk · contribs), BethNaught (talk · contribs), DBrant (WMF) (talk · contribs), Ewulczyn (WMF) (talk · contribs), Blackmane (talk · contribs), My Chemistry romantic (talk · contribs), Sobsz (talk · contribs), QianCheng (talk · contribs), RHo (WMF) (talk · contribs), Gulumeemee (talk · contribs), GrapefruitSculpin (talk · contribs), Jsamwrites (talk · contribs), Chill-- (talk · contribs)
Thanks to all of you for contributing labels to this campaign! We're currently at 1303 out of 6333 labeled edits (20.6%). That's a lot of work. I'm personally really excited about incorporating these labels into ORES training because it should allow us to increase ORES accuracy substantially and to also make sure that ORES keeps up with trends in vandalism. There's 28 of us working on this campaign. If we split the work evenly, we'll only have to label less than 200 more revisions each (that's 4 worksets). I'll be doing my 200 on my lunch break today.  :) --EpochFail (talkcontribs) 16:43, 20 April 2017 (UTC)Reply
For convenience, we're being directed through to this page to find the worksets. Now this page also contains a link to "Discussion quality". Is our input requested on that project as well, and are there any guidance notes?: Noyster (talk), 18:34, 20 April 2017 (UTC)Reply
Good Q Noyster. The "discussion quality" campaign was started by a researcher who does not seem to be active anymore. I'll disable that campaign and ping the researcher to request some documentation. --EpochFail (talkcontribs) 18:45, 20 April 2017 (UTC)Reply

Still including edits in non-article spaces edit

Just one set of 50 included edits in Talk, User, User talk, Draft and Wikipedia spaces. Is this intended?: Noyster (talk), 10:05, 25 August 2016 (UTC)Reply

That's right. This is by design. I was just doing a study of activity on vandalism in User space. See Phab:T141829#2580862. It's important that we train ORES to catch this type of vandalism. --EpochFail (talkcontribs) 17:54, 26 August 2016 (UTC)Reply

Wikipedia style AI evaluation edit

It's critical that the artificial intelligence (AI) models that power Wikipedia's tools are aligned to the community. I'm working with Tzusheng to build a system to evaluate the quality of these AI models used across Wikimedia projects, such as ORES and Liftwing. The system is specifically designed to support wiki-style discussion processes. I need your feedback! If you are interested in testing the system and sharing your feedback, please see m:Research:Community-centered Evaluation of AI Models on Wikipedia/Study Recruitment.

This project is documented at m:Research:Community-centered Evaluation of AI Models on Wikipedia.

Thanks! --EpochFail (talkcontribs) 20:07, 27 June 2023 (UTC)Reply