(Please read part 1 from last night first; I’m just picking up where I left off.)
Another proposal that would certainly reduce vandalism of Wikipedia articles would be to eliminate editing by unregistered users, either throughout Wikipedia or at least on BLPs. Presently, “anyone can edit” extends even to users who haven’t registered an account. In wiki parlance, unregistered users are referred to as “IP” editors, because in the article contribution histories, the IP number of the computer from which they edited is displayed instead of their username. This form of “anonymous editing” should not be confused with a different sort of anonymity, which allows users to register under pseudonyms without providing their real names.
The main value of allowing IP editing is that it gives brand-new users the ability to try out “anyone can edit” for themselves, without taking the time and trouble to register. Many new users make their first edits as IPs, often after spotting a typo in an article or noting that some information is missing, and there is a fear that if registration were required to edit, some proportion of first-timers wouldn’t bother, and therefore would never develop the habit of contributing and become “Wikipedians.” For example, this is precisely how I got started in editing, as I mentioned the other night.
While IPs contribute many good-faith edits and some become regular contributors, IP editors are also responsible for much of the drive-by vandalism — often, but by no means always, committed by bored schoolchildren — that afflicts many pages (and gives other editors the opportunity to earn credentials as “vandalism fighters”). The ratio between valid and vandalistic edits by IPs is sufficiently low that from time to time there is discussion of requiring registration to edit. A significant step in that direction was taken in 2006, when users were required to register before creating a new page (as opposed to editing an old one).
An intermediate step would be to disallow IP editing just on BLPs. Administrators have the ability to “semiprotect” any page of Wikipedia. A semiprotected page cannot be edited by IPs or by newly registered editors. (A “full protected” page cannot be edited by anyone, except for administrators under very specific guidelines.) Pages are semiprotected usually when they are being vandalized by IPs, typically for short periods by sometimes for a longer term or indefinitely. (For example, [[George W. Bush]] or [[Hillary Clinton]] could probably never be unprotected without being overrun, but those are unusual cases.)
It has been proposed that either all BLPs be permanently semiprotected, or at least that they be liberally semiprotected at a lower threshold of vandalism or at the subjects’ requests. This would certainly reduce the amount of vandalism and defamation from non-registered IPs. (An objection is that it would also eliminate the ability of an unregistered editor, perhaps the article subject himself or herself, to fix vandalism or remove defamation. I don’t know how often this happens.)
The most recently proposed approach for reducing BLP violations and other types of bad edits is called “flagged revisions.” The idea of giving this approach at least a trial was supported by a majority of English Wikipedia editors who participated in a recent poll, and it has already been implemented on the German Wikipedia. There are various somewhat different proposals for how this could be done, either on all articles, or on BLP articles, or some subset of them. In general terms, flagged revisions means that anyone can still edit an article — but the edit does not become visible to readers until another editor has reviewed and approved it. It introduces some level of quality control; it also, some say, represents a step away from “anyone can edit.”
This procedure itself raises some questions of implementation. Some are mechanical, such as, what happens when User:B edits the same sentence that User:A has just edited, but before the edit has been flagged? Others are more substantive, such as who gets to be an edit-flagger, and what standards do they use in flagging? If a flagger sees that someone wants to edit Jones’s biography by adding “Jones is a jerk!” then he or she will disapprove the edit — but that’s not really the type of edit that, if a few people see it before it gets reverted, will really damage Jones’s reputation (though it will damage Wikipedia’s). The more subtle defamations may never be recognized by a reviewer who is intelligent and dedicated but unfamiliar with Jones’s life and work — and so they will still make it into the articles — only now they would come with an “approved by an official revision flagger” seal of approval.
The English Wikipedia is struggling with whether to take a step toward flagged revisions. Proponents suggest that it's a long overdue necessary step to address an obvious fault with the site; opponents suggest it would be the death-knell of the "anyone can edit" philosophy that attracts people to contribute. A threshold issue is there is no clear governance process on the English Wikipedia for issues like this, so no one even knows just how the decision will be made. (I'll talk more about governance in a day or two.)
Incidentally, because the issue has come up in the comments, there have been relatively few lawsuits brought by individuals claiming to have been defamed on Wikipedia. To the best of my knowledge, there have been no successful defamation suits against the Wikimedia Foundation, which is the not-for-profit foundation (formerly headquartered in Florida and currently in California) that owns the hardware on which Wikipedia’s and its sister projects’ data reside and the Wikipedia trademark.
In very general terms, the Foundation’s position has been that because it does not create or control the specific contents of any particular page, it is shielded from liability for defamatary content contributed by any user pursuant to Section 230 of the Communications Act (47 U.S.C. § 230(c)), which provides that “[n]o provider or user of an interactive computer service shall be treated as the publisher or speaker of any information provided by another information content provider.”
I know of no reported cases applying Section 230 to a claim against the Wikimedia Foundation. There is one unreported case, Bauer v. Glatzer in the Superior Court of New Jersey, which upheld the Foundation’s immunity. A leading case discussing Section 230 more generally is cf. Barrett v. Rosenthal, 40 Cal. 4th 33, 146 P.3d 510, 51 Cal. Rptr. 3d 55 (2006), while an interesting law review article analyzing the application of Section 230 to Wikipedia is Ken S. Myers, Wikimmunity: Fitting the Communications Decency Act to Wikipedia, 20 Harv. J. L. & Tech. 162 (2006).
I wish very much that I were ending this post with a brilliant solution to problematic content regarding living persons on Wikipedia, but I don’t have one, even after having thought about this matter from lots of angles for close to three years. One of the reasons I asked Eugene if I could post here was to see what the readers here — legally and technically savvy, but without a vested interest in how the issue is addressed — might have to say about these issues. I'll move on to other topics in the next few days, but I'll continue reading the comments here. I'll do my best to respond to some of them before my blogging stint here is up.
Related Posts (on one page):
stupidto figure it outFor example, take biographies of the 18th President, U.S. Grant. Some are laudatory, some are contemptuous, some are accurate, some are garbage. But note that all are signed, and all prosper or founder based on sales, which are a reflection of a combination of the author's or authors' prior reputation and their utility to customers, integrated over some characteristic time scale (years to decades). If you write an inflammatory book filled with half-truths, you may achieve remarkable sales the first year, but you will inevitably not be writing a second edition, and, furthermore, your next book won't be nearly as well received initially, because your reputation will precede you.
Let people create BLPs at will. But let each be created by a specific individual or team, who must sign them. Whoever starts the article signs it, and can decide whether or not to let other people join the team, who must also sign it, but who can opt to abandon the team and delete their name at any time.
If anyone else wants to start a different page, on the same subject, let them, with the same rules. Have one of your "disambiguation" or navigation pages that links them all together, so people can see the available selection, all in one place. (You can even require that when more than one page is available for, say, George Bush, each George Bush page have navigation pointers at the bottom to all other George Bush page. Perhaps the person creating the new page is required to add in his own description of why he's creating a separate page. George Bush -- The Real Story or George Bush, Not Leftwing Hysterical Cant und so weiter. There's no percentage in duplicating The Real Truth ad infinitum, of course, because you won't stand out. If you want traffic, you'll need to give a reasonably accurate definition of what makes your page unique.
Now you just wait. Watch the traffic, if you wish. You will find, inevitably, that some pages and some authors acquire more and more traffic, steady traffic, over time, while others may be momentarily interesting, but quickly fall into disuse. That tells you what your community verdict is. Based on that, if you wish, you can delete the least trafficked pages -- although I would suggest instead merely inserting the relevant statistics into your navigation page, so new surfers know which version of the pages is heavily used. Folks who rely on the wisdom of crowds will go right to the most popular pages, while iconoclasts will presumably visit the outliers.
You can also use this to vet the credibility and authority of the authors, if you like. Someone who consistently builds pages with large and sustained viewerships, and few competing pages, is someone to trust -- or at least, whom the community trusts -- and can be granted greater privileges, perhaps.
This is the ancient system worked out after Gutenberg invented that newfangled thingy where you can duplicate words on paper very fast, after all. Has worked quite well for the past five centuries. In a modified form, it's even worked well in the creation of open software.
I've always wondered why Wikipedia didn't do something as obvious as this, more or less follow the book, newspaper, and blog model -- multiple versions competing in a free marketplace of ideas. Indeed, the fact that it so strenuously resists having an open competition between narratives, by insisting that there be One Correct Dogma on each substantive question, has done a fair amount to reduce my opinion of its goals and methods.
I think that's a great idea, but it won't ever happen. Some present administrators would have to allow the idea to percolate that there is no such thing as "consensus" on their pet subjects.
Of course, living people who actually do things which are stupid, shameful, or just plain evil will generally not be content with biographies of themselves which allow the reader to draw that conclusion. They may initiate libel suits, but fortunately in the United States truth is a defense to claims of libel.
Omitted truth can be just as much a lie as included falsehoods.
So, I'll just add a couple points:
1. Anyone with a few hundred dollars or less to spend per month can have an endless variety of sockpuppets. Simply subscribe to AOL plus a dozen other dynamic IP ISPs, and you'll get a new IP each time. For especially interesting edits, call an out-of-town POP. I'd imagine that's how the smart folks do it. You could be making changes to WP all day long using a few dozen different identities with no way to trace you unless you did something illegal. So, for those who are committed to getting the information they want into top search results - such as political candidates, corporations, etc. - the issue of IPs and so on is just a red herring.
2. One workable way to do things would be to require positive ID of those who want to edit certain articles. They can be prominently identified as the authors of the article, with perhaps a section that only they can edit.
Please see my previous comments in the earlier entries (listed at the end of this post).
The judge rejected the request, on grounds that:
* The plaintiffs had not demonstrated that they had informed the Foundation of the allegedly illegal material. (They claimed they had sent an email but provided no indication that it was ever received.)
* Even if that email had been received, it would not have conveyed enough proof to the hosting provider that the material was manifestly illegal (hosting providers are forced to withdraw material only if it is blatantly illegal, e.g. child pornography; for material to be blatantly illegal because of libel, it probably takes a court order).
* When informed of the court summons, the Foundation, hosting provider, quickly removed the offending content.
In short, the Foundation was shielded by its status as a hosting provider with no editorial role/
I advocate signed - digitally signed - wikipedia pages, consortia of contributors to individual articles led by an editor, and multiple articles per topic.
Imagine the disambiguation page allowed reviews of articles, similar to product reviews on amazon?
(I also advocate digital signing of online newspapers (of which there are none, as far as I can tell), mostly for the value of non-repudiation).
I am not retired, and still working at a pace that prevents me from doing some of the fun things (like edit Wikipedia) I'd like to do. I hear people claim that retirees find themselves with too little to do - at the moment, I'm jealous, but understand it could be a real concern. I wonder if Wikipedia should actively court retired people, many of whom still know how to compose an English sentence.) Perhaps this is already happening, and I haven't seen it, but it is one reasons I'd be interested in the demographics. What proportion of Wikipedia editors are retired, and would it be useful to actively court retirees?
Splunge's proposal is brilliant, and exactly what is needed to produce an online encyclopedia that is truly useful. As it is, wikipedia presents a "one true story" approach to every distinct topic or person - which, as is obvious to many, is useless, or worse, harmful.
Splunge's proposal is idiotic, and is exactly what is needed to completely eviscerate the entire idea of having an encyclopedia. As it is, wikipedia presents a "one true story" approach to every distinct topic or person - which, as is obvious to many, is useful, or better, makes it easy to quickly research a topic, which is why we have encyclopedias (secondary sources) in the first place. For those who want more authoritative findings, please consult a primary source, as I am sure you were taught in middle school. If you would like a multiplicity of opinions, there is this wonderful thing called the "intertubez" that is searchable by "teh googlez" so you can read the "one true story" about Bill Ayers and Jeremiah Wright at 24aheaddotcom's website if you don't find enough at wikipedia.
You've basically just described Google Knols. It's a fundamentally different approach. One of the problems with it is that the good gets crowded out by the self-serving and spammy.
A far greater and insidious problem is the enormous POV pushing by leftist editors. One example is the anti-Americanism that is manifested in anything related to American foreign policy. The most egregious case of this is the systematic advocacy of the theory of anthropogenic global warming. The entire topic of global warming is subject to the Group Ownership of William M. Connolley, Kim D. Petersen, Raymond Arritt, Stephan Schulz and some others. Some of these folks are administrators and systematically revert and suppress any contrary or dissenting viewpoint.
Until the leftist cabals are broken up, there is little point in going after anonymous editors who mostly have nuisance value.
Perhaps a comment thread or some way to add notes next to paragraphs? So that you read a biography of someone who is alive and when you get to a controversial paragraph, you see a link to the comments/discussion about that paragraph. Click on it and you get to see the vitriolic debate/argument. Don't click on it if you don't want to see it.
I'd also like to see Wikipedia incorporate some kind of rating system, though I'm not sure how one would do that. Rating systems are subject to all kinds of abuse by the same people who seem to have the free time to engage in edit wars. Rating systems on editors and/or edits could conceivably be helpful.
It is interesting to note that much of what made printed encyclopedias so important is immaterial with the availability of the web.
It is further interesting to note that Wikipedia defines encyclopedia in its own image, in particular, with this:
"Encyclopedias are divided into articles with one article on each subject covered. "
Which is not only not true for encyclopedias in general, but not even for Wikipedia; it may, however reflect a policy or philosophy of those "running" Wikipedia.
There is nothing anathema to encyclopedias in having multiple articles on the same topic, especially for controversial topics. Indeed, to force a single article may be to obscure the truth.
This may be an astounding concept to you, but as I alluded to in an earlier post, there is this wonderful concept called the "intertubez". On this "intertubez" there are numerous articles on all sorts of subjects. You can find these articles by using, inter alia, things called "search engines". I have heard of one of these "search engines" that is colloquially called "teh googlez" (located on the part of the intertubez where thar be dragonz).
Wikipedia is an attempt to have a user-created encyclopedia (a wiki, or community, created, encyclopedia... if only there was some short hand way of referring to that...). It has been very successful so far. And yet, whenever some fringe or political point of view feels that they haven't had their fair shake, they demand that wikipedia change their rules to better present their point of view, which is often in the minority. To "teach the controversy" if you will.
Here's a newsflash- to make (force... you damned statists) wikipedia to post all the nonsense goes against their whole concept. It would make the process less useful to end users (which of the 98 articles should I look at?) and would end up creating the basic process *that already exists on the intertubez*. If that's what you want- USE TEH GOOGLEZ! That's how it works; more popular sites get ranked higher, and people will go to them. The proposed solution is to make wikipedia = teh googlez, and as we have seen with knols, that is not what people want.
Think about it this way: what if nearly every search had conservapedia as the first or second result?
And, regarding whoever said articles (such as knol-type things) can be voted on probably doesn't have as much experience with how that works. For instance, before the election there were many points where Digg's top lists consisted entirely of anti-McCain/pro-BHO articles from the same sites: HuffPost and so on. Substantial comments pointing out how a Digg is inaccurate are frequently voted down so they aren't visible without a click. That style of voting doesn't work due to the lack of accountability and the use of sockpuppets. For non-WP examples of it not working and a better way to do things, see this.
Their sole purpose has been to generate a semi-annual, semi-idiotic flood of media coverage about the Wouldn't It Be Luverlies of Wikipedia's Good Intentions.
Jon Awbrey
I think your logic is wrong. A Wikipedia article doesn't pop up first on google because it is wikipedia. It pops up first because it has the information that people want. If wikipedia had information that people didn't want, it wouldn't pop up first, and people would not click on it first.
Likewise if conservapedia had information people wanted it would pop up higher. Turning Wikipedia into a google search, won't bring more people to wikipedia. It will just encourage people to go to other links.
A site's overall value is a major part of search engine rankings. A wikipedia page on a particular topic will get a high placement simply due to WP's overall cachet, irrespective of that particular page's individual value.
Wikipedia was one of them.
What do you call it when a human fails the Turing test?
There are many threads discussing these issues at The Wikipedia Review.
I've known a couple people who worked at google as programmers, both told me that internal links are not counted.
I would be mightily e-mused if you could them to put that on the record here.
There isn't space here to discuss all the examples that I and many others have examined over the years, but one thing is abundantly clear -- the Google PageRank has nothing to do with the quality or even the quantity of meaningful content on the page in question.
Here's just one for fun:
Ampheck
WP pages have high rankings because lots of external sites link to wikipedia pages. That helps every wikipedia page regardless of whether external sources link to that page or not.
I would not, however, be surprised if the claim about canonical source is true. That is a different claim than internal links boosting ranks.
That particular search probe was amusing because it shows that Google is too dumb to tell the difference between "Ampheck", a term of art in logic, and eleventy gadshillion variants of "Amp? — Heck!", which is apparently an extremely popular form of e-jaculation on audiophile discussion boards all over the Dubya³.
At any rate it should be clear on intuitive grounds that a PageRank algorithm that simply counts links, whether external or internal to a domain, simply cannot tell whether people are using those links to say positive or negative things about the sites they reference.
Could you explain why Google's official blog dedicated to webmasters says:
"Link architecture — the method of internal linking on your site — is a crucial step in site design if you want your site indexed by search engines."
...and...
"Verify that Googlebot finds your internal links
For verified site owners, Webmaster Tools has the feature 'Links > Pages with internal links' that's great for verifying that Googlebot finds most of the links you'd expect. This is especially useful if your site uses navigation involving JavaScript (which Googlebot doesn't always execute) — you'll want to make sure that Googlebot is finding other internal links as expected."
These were written by Maile Ohye, Developer Programs Tech Lead at Google. What is the name of your source who says that internal links don't assist in search engine rankings?
Look, I run MyWikiBiz.com, which ranks as about the 95,000th most popular website on the Internet. Frankly, that pretty much sucks. But, it provides me the opportunity to experiment with various ways of testing Google PageRank dynamics. I guarantee you, if you gave me two brand new words to "test" Google -- one word, I'd load up with internal links in its definition; the second word, devoid of internal links -- the second word might not even show up on Google, while the first one absolutely will.
I'm sorry to sound rude, Soronel, but you have no idea what you're talking about.
whitewashbiases in the article.Unfortunately, Westlaw won't show the article if you try to find it using the citation in Newyorkbrad's post above due to the incorrect page number.
If you have a comment about spelling, typos, or format errors, please e-mail the poster directly rather than posting a comment.
Comment Policy: We reserve the right to edit or delete comments, and in extreme cases to ban commenters, at our discretion. Comments must be relevant and civil (and, especially, free of name-calling). We think of comment threads like dinner parties at our homes. If you make the party unpleasant for us or for others, we'd rather you went elsewhere. We're happy to see a wide range of viewpoints, but we want all of them to be expressed as politely as possible.
We realize that such a comment policy can never be evenly enforced, because we can't possibly monitor every comment equally well. Hundreds of comments are posted every day here, and we don't read them all. Those we read, we read with different degrees of attention, and in different moods. We try to be fair, but we make no promises.
And remember, it's a big Internet. If you think we were mistaken in removing your post (or, in extreme cases, in removing you) -- or if you prefer a more free-for-all approach -- there are surely plenty of ways you can still get your views out.