Connectivity in Wikipedia


I’ve posted some info about my works on Connectivity in Wikipedia recently. So, Connectivity Project was started to connect all articles in Wikipedia. XKCD comics illustrates this in almost ideal fashion.

 

The Problem with Wikipedia, XKCD comics, © Randall Munroe, CC-BY-NC 2.5

 

You see that starting your surfing through Wikipedia from an article about, say, diesel engine, you can end up reading about black hole information paradox (which is very fascinating as for me).

Connectivity is not a problem in English Wikipedia as well. Although, in developing Wikipedias it is. For instance, in my home Belarusian (Taraškievica) Wikipedia there are some articles that are linked from almost no articles at all. Some articles don’t have links to other articles as well. All these moments create one big Connectivity problem, when you can’t get from one topic to another.

Connectivity Project, raised in Russian Wikipedia first, serves as an opportunity to connect articles in one big mesh. It performs advanced SQL queries in Wikimedia Toolserver databases to detect isolated articles of many sorts: orphans (non-linked articles), non-categorized articles and dead-end articles (ones without links to other articles).

Although it’s a huge amount of work, it’s still only a half of all business. Wikipedia editors need to know which article they should fix. That’s why numerous templates exist for article tagging. For this moment, it’s done in semi-automatic way: an on-duty-for-connectivity editor gets a pack of files for AWB (look the link) and processes one file at the time.

The most important aim now is to fully automate the process of tagging/untagging articles. That’s what I’m doing now. One day the bot I’m coding should set and unset all templates on his own. I’ve already tested it (with i18n features enabled) and it seems to work.

,\\// Live long and prosper.

About wizardist

An overrated person.
This entry was posted in Interests, Routine, Technologies and tagged , . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s