Building the bots that keep Wikipedia fresh – GCN.com
Building the bots that keep Wikipedia fresh
While we can all learn from Wikipedias 40 million articles, government bot builders specifically can get a significant education by studying the creation, vetting and roles of the 1,601 bots that help maintain the site and interact with its more than 137,000 human editors.
Researchers at Stevens Institute of Technology classified the Wikipedia bots into nine roles and 25 associated functions with the goal of understanding what bots do now and what they might do in the future. Jeffrey Nickerson, professor and associate dean of research at Stevens School of Business, and an author of The Roles Bots Play in Wikipedia, published in November 2019, likened the classification to the way humans talk about occupations and professions, the skills required to do them and the tasks that must be performed.
Each bot performs a unique job: some generate articles based on templates; some fix typos, spelling mistakes and errors in links; some identify spam, vandals or policy violations; some interact with humans by greeting newcomers, sending notifications or providing suggestions.
The nine main roles account for about 10% of all activity on the site and up to 88% of activity on subsections, such as the Wikidata platform, where more than 1,200 fixer bots have made a total of more than 80 million edits, according to the report.
Anyone can build a bot -- an automated, artificial intelligence-powered software tool -- for use in Wikipedia, but before its deployed, it needs the blessing of the Bot Approval Group. Members determine what the bot will do and which pages it will touch, and they review a trial run of the bot on sample data. That may be all that's required, or the group may also ask to check the source code, Nickerson said. That entire process is public.
Its a good place to start [for bot builders] because you can actually see it, Nickerson said. You can see the bots that are successful, and you can see the conversations take place there, and you can see the way the developers of the bots actually talk to the end users.
Builders consider risks and advantages of their bots, what functions they will start with and which features will come later, and how their bot might interact with others that perform similar functions, for example, he said.
Theres this vetting of the bot, Nickerson said. If the bot is going to do something fairly minor and not on very many pages, there may be less vetting than if the bot is going to create a whole bunch of new pages or is going to do a lot edits.
Another feature of the Wikipedia bots is how they work with human editors. Often, editors create a bot to automate some of their editing processes, Nickerson said. Once they build it, they set it loose and check on it periodically. That frees the editors to do the work that most interests them, but they also become bot maintainers.
The subsection of Wikipedia called Wikidata, a collaboratively editedknowledge baseof open source data, is especially bot-intensive. The platform is a knowledge graph, meaning that every piece of knowledge has a little fact involved and because of the way these are hooked together, the value of it can be a link to another fact, and essentially it forms a very, very large graph, Nickerson said.
Wikidatas factual information is used in knowledge production in Wikipedia articles, thanks to adviser and fixer bots. For example, when theres an election, the results will populate in Wikidata, and pages about a citys government will automatically update the name of the mayor by extracting election information from Wikidata.
Bots interaction with human editors are critical to the success of a website based on knowledge production. On Wikipedia, if someone makes an incorrect edit, a bot may reverse that change and explain what was wrong. Being corrected by a machine can be unpleasant, Nickerson said, but bots can also be diplomatic.
The researchers call these first- and second-order effects. The former are the knowledge artifacts the bots help protect or create, while the latter are the reactions they bring out in humans.
They can actually pay attention to what people are interested in, he said. They can be patient. They can direct somebody toward a page that they know with high probability is going to be the kind of page where that person can actually make an important contribution. The instinct of some people is to go to the pages that are actually very highly edited and very mature and try to make changes to those pages, and thats actually not the right place to start. The place to start is with a page that is newer and needs a particular kind of expertise.
When human editors have a positive interaction with bots right out of the gate, that helps with the cultural aspect of bot building. It also provides insight into what makes a bot successful -- a topic Nickerson plans to study more in the future.
Researchers at MIT, meanwhile, have developed a system to further automate the work done by Wikipedias human editors. Rather than editors crafting updates, a text generating system would take unstructured information and rewrite the entry in a humanlike fashion.
Unlike the rules-based bots on the site, MITs bot takes as input an outdated sentence from a Wikipedia article, plus a separate claim sentence that contains the updated and conflicting information, according to a report in MIT News. The system updates the facts but maintains the existing style and grammar. Thats an easy task for humans, but a novel one in machine learning, it added.
About the Author
Stephanie Kanowitz is a freelance writer based in northern Virginia.
Read the rest here:
Building the bots that keep Wikipedia fresh - GCN.com
- What were the most popular Wikipedia pages of 2024? - Roanoke Times - December 22nd, 2024 [December 22nd, 2024]
- What we learned from Open AI whistleblower Suchir Balaji's Wikipedia Page - The Times of India - December 18th, 2024 [December 18th, 2024]
- From an old version of the Wikipedia page for Warren G and N... - kottke.org - December 18th, 2024 [December 18th, 2024]
- What were the most popular Wikipedia pages of 2024? - WCF Courier - December 18th, 2024 [December 18th, 2024]
- Encyclopedia of the Future: Why is Wikipedia Best Research Option? - Analytics Insight - December 18th, 2024 [December 18th, 2024]
- Wikipedia's Most-Viewed Articles of 2024: Politics, Football, and...Death? - PCMag Middle East - December 18th, 2024 [December 18th, 2024]
- Taxiride Fallout Continues Over Alleged Amendments To Band Wikipedia Page - The Music - December 18th, 2024 [December 18th, 2024]
- Delhi High Court to examine Caravan, Ken articles to decide interim relief in ANI vs Wikipedia - Bar & Bench - Indian Legal News - December 18th, 2024 [December 18th, 2024]
- Boriswave Wikipedia page set up in reference to immigration surge under ex-PM - The London Economic - December 18th, 2024 [December 18th, 2024]
- Wikipedia suspends pro-Palestine editors coordinating efforts behind the scenes - The Jerusalem Post - December 14th, 2024 [December 14th, 2024]
- Wikipedia's 7-year yogurt spelling war was longer than three Shakespeare plays - Boing Boing - December 14th, 2024 [December 14th, 2024]
- Wikipedia boyfriends on celebrating their mundane, anti-online corner of the internet - British GQ - December 14th, 2024 [December 14th, 2024]
- What were the most popular Wikipedia pages of 2024? - York News-Times - December 14th, 2024 [December 14th, 2024]
- Wikipedia's Most-Viewed Articles of 2024: Politics, Football, and...Death? - PCMag UK - December 14th, 2024 [December 14th, 2024]
- What were the most popular Wikipedia pages of 2024? - Martinsville Bulletin - December 14th, 2024 [December 14th, 2024]
- Death most popular thing on Wikipedia, again - Boing Boing - December 5th, 2024 [December 5th, 2024]
- Heres the top 25 list of most-viewed Wikipedia articles of 2024 - KXAN.com - December 5th, 2024 [December 5th, 2024]
- Here Are the Top 25 Wikipedia Searches for 2024 And #1 is BLEAK - Mediaite - December 5th, 2024 [December 5th, 2024]
- Morrissey hits out at Wikipedia for failing to set the record straight - The Independent - December 5th, 2024 [December 5th, 2024]
- Jimmy Wales on Why Wikipedia Is Still So Good - New York Magazine - December 5th, 2024 [December 5th, 2024]
- Here Are The 5 Most Read Wikipedia Pages In 2024 - The Spun - December 5th, 2024 [December 5th, 2024]
- Wikipedia reveals its most searched posts - 97.1 The Ticket - December 5th, 2024 [December 5th, 2024]
- Wikipedia just revealed what weve all been obsessing over in 2024 - Sherwood News - December 5th, 2024 [December 5th, 2024]
- The Terrible Towel Wikipedia page is a must-read yinzer masterpiece - PGH City Paper - December 5th, 2024 [December 5th, 2024]
- The Most Popular Wikipedia Pages Of The Year - iHeart - December 5th, 2024 [December 5th, 2024]
- Neither Donald Trump nor Taylor Swift: This was the most-viewed Wikipedia page in the U.S. in 2024 - AS USA - December 5th, 2024 [December 5th, 2024]
- What were the most popular Wikipedia pages of 2024? - Winona Daily News - December 5th, 2024 [December 5th, 2024]
- Morrissey Mad At Wikipedia, Claims He Was Never In The Nosebleeds Nor Slaughter And The Dogs - Stereogum - December 5th, 2024 [December 5th, 2024]
- Heres the top 25 list of most-viewed Wikipedia articles of 2024 - MSN - December 5th, 2024 [December 5th, 2024]
- The Nosebleeds and Slaughter And The Dogs Band members list explored as Morrissey slams Wikipedia listing - Soap Central - December 5th, 2024 [December 5th, 2024]
- Diddy, Dune, and Donald Trump: The most popular Wikipedia pages of 2024 - STV News - December 5th, 2024 [December 5th, 2024]
- India's bollywood, elections, and IPL among top 10 most viewed articles on Wikipedia - The Tatva - December 5th, 2024 [December 5th, 2024]
- Morrissey says he has no connection with The Nosebleeds and Slaughter And The Dogs, despite claims on Wikipedia - NME - December 5th, 2024 [December 5th, 2024]
- Wikipedia Called To Order By Samson Mow: The Urgency To Invest In Bitcoin - Cointribune EN - December 5th, 2024 [December 5th, 2024]
- Wikipedia and the ANI defamation suit | Explained - The Hindu - December 5th, 2024 [December 5th, 2024]
- A Wikipedia for cells: researchers get an updated look at the Human Cell Atlas, and its remarkable - Nature.com - November 23rd, 2024 [November 23rd, 2024]
- Opinion: Wikipedia has it out for Israel, and weve got the data to prove it - National Post - November 23rd, 2024 [November 23rd, 2024]
- Who edits history? Politics and business in the pages of Wikipedia - EU Reporter - November 23rd, 2024 [November 23rd, 2024]
- What your Wikipedia reading says about you: Study find different styles - The New Daily - November 14th, 2024 [November 14th, 2024]
- Going down a Wikipedia rabbit hole? Science says youre one of these three types - The Conversation - October 26th, 2024 [October 26th, 2024]
- Studying Wikipedia browsing habits to learn how people learn - Penn Today - October 26th, 2024 [October 26th, 2024]
- Portland mayor candidate Rene Gonzalez violated rules by using public funds on Wikipedia page, auditor finds - Oregon Public Broadcasting - October 26th, 2024 [October 26th, 2024]
- Top 5 Editing Conflicts in Wikipedia Pages on Religion - Baptist News Global - October 26th, 2024 [October 26th, 2024]
- Wikipedia editors form urgent task force to combat rampant issues with recent wave of content: 'The entire thing was ... [a] hoax' - Yahoo! Voices - October 26th, 2024 [October 26th, 2024]
- Audit: Rene Gonzalez violated campaign finance law by using city funds to edit Wikipedia page - Fox 12 Oregon - October 26th, 2024 [October 26th, 2024]
- Auditor: Gonzalez violated the law by paying to update his Wikipedia entry - Portland Tribune - October 26th, 2024 [October 26th, 2024]
- Musk Says Wikipedia Controlled By Far-Left Activists, Urges People To Stop Donating To Them! - News24 - October 26th, 2024 [October 26th, 2024]
- Silent Hill 2 Remake Wikipedia page locked after salty fans try to rewrite its critically-acclaimed reception - Eurogamer - October 9th, 2024 [October 9th, 2024]
- The Silent Hill 2 Remakes Wikipedia page briefly got transformed into a phantasmagorical reflection of the psyches of idiots unable to accept reality... - October 9th, 2024 [October 9th, 2024]
- Outrage as Wikipedia changes grooming gangs article to moral panic from the 'Far-Right' - GB News - October 9th, 2024 [October 9th, 2024]
- Silent Hill 2 Falls Victim to Faux Review Bombing on Wikipedia - DualShockers - October 9th, 2024 [October 9th, 2024]
- No, you're not losing it, Silent Hill 2 Remake's Wikipedia page's review scores have been altered, and the site has had to lock it to stop people... - October 9th, 2024 [October 9th, 2024]
- Exploring (and building) the depths of Wikipedia - The Michigan Daily - October 9th, 2024 [October 9th, 2024]
- Wikipedia and Catholicism: Navigating Misinformation and Religious Bias - World Religion News - October 9th, 2024 [October 9th, 2024]
- Weird things are happening on the Silent Hill 2 remake Wikipedia page, as folks sabotage review scores for reasons - Sports Illustrated - October 9th, 2024 [October 9th, 2024]
- Silent Hill 2 Remake Wikipedia Page Locked After Fans Tried to Change Reviews - Rely on Horror - October 9th, 2024 [October 9th, 2024]
- Trolls Edit Silent Hill 2 Remake Wikipedia Page To Lower Its Review Scores - PlayStation Universe - October 9th, 2024 [October 9th, 2024]
- The Kremlin is rewriting Wikipedia - Hindustan Times - October 9th, 2024 [October 9th, 2024]
- Wikipedia Locks Silent Hill 2 Remake Page After It's Spammed With Fake Negative Reviews - TheGamer - October 9th, 2024 [October 9th, 2024]
- Silent Hill 2 remake Wikipedia locked after getting trolled - NME - October 9th, 2024 [October 9th, 2024]
- Wikimedia Technology Summit 2024 brings together tech enthusiasts and developers to bring inclusivity to Wikipedia and Wikimedia projects - Business... - October 9th, 2024 [October 9th, 2024]
- AI's threat to Wikipedia - ABC News - October 9th, 2024 [October 9th, 2024]
- Silent Hill 2 remake page on Wikipedia blocked after fans try to rewrite critics' positive reviews - ITC - October 9th, 2024 [October 9th, 2024]
- Matt Walsh Recalls Critics Trying to Get Him Arrested Using Wikipedia - The Daily Wire - October 4th, 2024 [October 4th, 2024]
- Wikipedia and Religion: Uncovering the Dynamics of Reliable Sources and Digital Bias - Baptist News Global - October 4th, 2024 [October 4th, 2024]
- Wikipedia: Accuracy or Prejudice? Islamophobia in the Web 2.0 Era - World Religion News - October 4th, 2024 [October 4th, 2024]
- Ultrarunner Camille Herron is dumped by Lululemon after her husband edited her rivals' Wikipedia pages to boos - Daily Mail - October 3rd, 2024 [October 3rd, 2024]
- Ultrarunner Camille Herrons Primary Sponsor Drops Her After Wikipedia Scandal - Runner's World - October 3rd, 2024 [October 3rd, 2024]
- Ultrarunner Camille Herron dropped by Lululemon following Wikipedia editing controversy - Runner's World UK - October 3rd, 2024 [October 3rd, 2024]
- Wikipedia relies on army of volunteers as it stares down AI - Devex - October 3rd, 2024 [October 3rd, 2024]
- This Ultramarathon Runner Was Dropped By A Major Sponsor Amid A Wikipedia Editing Scandal - Women's Health - October 3rd, 2024 [October 3rd, 2024]
- Wikipedia scandal: Heres why ultrarunner Camille Herron was dropped by Lululemon - Women's Agenda - October 3rd, 2024 [October 3rd, 2024]
- Guess The Wikipedia Footballer #4: Can you name these 10 footballers that played under Carlo Ancelotti? - Planet Football - October 3rd, 2024 [October 3rd, 2024]
- ANI vs Wikipedia: The free encyclopedias impact on India and more - The Hindu - September 16th, 2024 [September 16th, 2024]
- Wikipedia and AI: Could artificial intelligence kill the online encyclopedia? - Newstalk - September 16th, 2024 [September 16th, 2024]
- Reliable Sources: How Wikipedia Admin David Gerard Launders His Grudges Into the Public Record - World Religion News - August 31st, 2024 [August 31st, 2024]
- Wikipedia and the Digital Services Act: Lessons on the strength of community and the future of internet regulation - Le Taurillon - August 31st, 2024 [August 31st, 2024]
- Depths Of Wikipedia: This Page Is Dedicated To The Weird Side Of Wikipedia (97 New Pics) - AOL - August 31st, 2024 [August 31st, 2024]
- Wikipedia's Longest-Running Hoax Remained Online for Almost 10 Years: The Story of Jar'Edo Wens - The Journal - August 31st, 2024 [August 31st, 2024]
- 40 Times People Found Such Hilarious Gems On Wikipedia, They Just Had To Share (New Pics) - Bored Panda - August 31st, 2024 [August 31st, 2024]