Talk:Artificial intelligence in Wikimedia projects

Latest comment: 3 months ago by JPxG in topic Is "Detox" real?

Undue weight edit

I am publishing this draft with two sections - AI for Wiki and Wiki for AI. I gave more weight to the AI for Wiki section just because that is the concept which has the attention of the popular press and popular science discussion.

The weight of the sources is not in the popular press, but actually in the academic literature. There are many more sources talking about the use of Wikimedia projects for off-wiki AI projects than there are sources and projects applying AI to develop Wikimedia projects. I wanted to establish this article to be more accessible to the first wave of readers whom I expect to want to read this content and sort their thoughts about it.

Wiki for AI is much bigger now and the foreseeable future is that Wikipedia content will either be the basis of future AI research or otherwise the future of AI research will have a basis in Wikimedia content branded and further developed as some next generation dataset.

I would like to see this article re-written to identify whatever review articles exist and to give fair weight to the various topics which papers most deeply examine. Blue Rasberry (talk) 17:04, 24 August 2018 (UTC)Reply

Wikipedia as a data set edit

  • Mehdi, Mohamad; Okoli, Chitu; Mesgari, Mostafa; Nielsen, Finn Årup; Lanamäki, Arto (March 2017). "Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus". Information Processing & Management. 53 (2): 505–529. doi:10.1016/j.ipm.2016.07.003.

This article identified 132 other academic articles which describe how they used Wikipedia as a data set at the base of other research. As I start this article, there is a subsection on how artificial intelligence projects uses Wikimedia content in their development. That subsection could be split off into its own article, and that split article could itself be split into other articles. One such possible article could be something like "Wikipedia as a data set", because there is this article and the 132 it identifies as reliable source material for developing this as an independent concept. Blue Rasberry (talk) 16:58, 24 August 2018 (UTC)Reply

"Artificial intelligence" as a buzzword edit

"Artificial intelligence" is a term with many meanings in various fields. Perhaps most of the sources which talk about artificial intelligence in Wikimedia are talking about machine learning, which often is another name for artificial intelligence. Blue Rasberry (talk) 17:12, 24 August 2018 (UTC)Reply

Bias study edit

There is no research reporting from this yet but here is a 2018 research project analyzing bias in Wikimedia data structuring.

The PIs on this Brent Hecht and Loren Terveen seem to comment on Wikidata.

Blue Rasberry (talk) 21:28, 3 December 2018 (UTC)Reply

ClueBot NG edit

Should a section about User:ClueBot NG be part of the article? The userpage states it detects vandalism with bayesian classifiers, a neural network, and a calculated threshold, but I'm not sure if that is "true" artificial intelligence. 172.112.210.32 (talk) 15:14, 22 December 2021 (UTC)Reply

Is "Detox" real? edit

Does this actually do anything? I've literally never seen a user blocked, or an edit prevented/flagged, by anything called "Detox". It's currently described as "Detox is a project to prevent users from posting unkind comments in Wikimedia community discussions" -- this doesn't seem true to me. jp×g🗯️ 18:14, 2 February 2024 (UTC)Reply