Plagiarism on Knol

A Call For Action

Discusses the problems of spam and plagiarism on the Google Knol service with particular reference to content copied from Wikipedia, in violation of Wikipedia's GNU Free Documentation License. Argues that a failure to act decisively against plagiarism of web content is a disservice both to the creators of that content as well as to the users of Google Knol. Every plagiarized article that appears on Google Knol dilutes the authority of all content published there.


 Knnc,


1. Knol's spam guard: the user


Google Knol has been beset with spam knols, making it inevitable that Google will either be forced to cancel the service or perform a radical overhaul of its procedures for dealing with what they like to call "low quality content". 

Although, since its inception, Google has had safeguards in place to prevent this scenario, they have had limited effectiveness because of the need for human monitoring of new content -- either on the part of Google employees or on the part of Knol authors and readers. 

Since the project's inception, Google's monitoring of Knol content has been a hands-off affair with minimal oversight of submitted content leading to the current state in which a large database of spam articles has built up. 

This leaves users of the service to fend for themselves when it comes to protecting Knol from spammers.  In this regard, end-users have been both helped and hindered by Knol's protections against spam.


2. Spam & "Flag inappropriate content"


By clicking on the Flag inappropriate content link which exists in the right hand panel of every Knol article, Knol users are able to report spam and other objectionable content to Google just as they might on Craigslist or many other social sites.  Having browsed around Knol for a while, I have witnessed spam articles flagged by users and removed by Google - so it's clear that this sytem works.  However, as the next example illustrates, when it comes to Google's flagging feature, there is a significant distinction between spam -- defined as inappropriate marketing -- and the practice of plagiarism which I would consider equally, if not more, problematic.




3. The NASA wiki


Consider, as Exhibit A, the following knols, which bear remarkable similarity to the Wikipedia article on NASA:

http://knol.google.com/k/darkwood-adrenalin/nasa/27bzio9crr17o/3#

http://knol.google.com/k/pramod-kumar/nasa/376d0pmz95tub/14#

Although my intention is not to single out individual authors for derision, the comments on these articles already indicate that their unattributed borrowing from Wikipedia has not gone unnoticed.

In the next sections I would like to point out why I think this is a problem and what Google Knol can do to combat this phenomenon. What I also want to make clear, however, is that I am not arguing for a return to the world of traditional Copyright - in the sense of "All Rights Reserved" - although all Knol authors have the ability to mark their work "All Rights Reserved" - as I have done with this Knol. What I am arguing instead, is that whatever copyright an author has chosen for his work should be respected and that any system like Knol that deals with content creation should have an adequate system in place to allow for an author's copyright/copyleft to be enforced. First let's consider Copyleft and what it means.

4. Remix and the Creative Commons


Lawrence Lessig has written extensively on the subject of digital copyright and has pointed out the importance of "Remix" a term which started out in the context of songs being modified by different DJs, but which in Lessig's terminology refers to taking cultural artifacts created by someone and spinning them into something entirely new through modification. Lessig gave a talk at Ted on the subject which you may be as inspired by as I was:



The Creative Commons organization, of which Lessig is a founder, has also created a very instructive video on the subject of the Creative Commons license, which allows authors to designate their work as being "remixable" since, by default, every content creator owns a copyright to their work once they have created it. This video is an excellent illustration of their case, using the example of some music by the White Stripes which inspired a remix of its own.



This phenomenon of Remix adds to the richness of culture by layering multiple creative perspectives. Indeed the phenomenon of Remix is evident in numerous articles on Google Knol, such as this tutorial knol which takes Youtube content created by the websites railscasts.com and tekniqal.com and, combining them, the Knol author presents his own perspective on their content. Whether the creators of those videos are aware of this new creation or whether they would approve of the new creation is not clear in this case but, by force of the Digital Millennium Copyright Act, Youtube is required to give content authors a method of appeal and, in the case of Youtube, most reports indicate that they have been extremely accomodating to content owners through their Copyright Infringement Notification appeal form. The challenge lies in striking a balance between protecting copyright and allowing for "Remix". In the case of Wikipedia, as I will explore in a moment, individual authors have control over the content that they supply to Wikipedia. However, "Remix" is also permitted if certain guidelines are followed.


5. The Misunderstood Wikipedia Copyright

Although Wikipedia uses the GNU Free Documentation License by default on all submissions, this does not mean that the content on the site is free for people to copy.  The license allows others to build upon Wikipedia content on condition that they provide attribution to Wikipedia -- a practice that is not followed on the Knol listed above. The full legal implications of the GFDL are outside the scope of this document but Wikimedia's guidelines are fairly clear in their discussion at the following link:

http://commons.wikimedia.org/wiki/Commons:Reusing_content_outside_Wikimedia

The relevant section of the document states:

Re-users are free to make derivative works and copy, distribute, display, and perform the work, even commercially. To comply, (a) release your version under the GFDL, (b) credit at least the five most substantial authors or content creators and (c) include a complete copy of the GFDL. In the case of derivative works you must also include the complete history section.

6. "Similar Content On The Web"


Coming back to the offending Knols discussed earlier, in the right hand panel of the Knol, Google's feature Similar content on the Web indicates when Google is able to identify a similarity between the content of the Knol and other content on the web. This feature is often quite successful in identifying offending Knols but they require user intervention before taking the final step of removing the content because the feature is not foolproof - in some cases the author of the Knol has legitimately published the Knol in two places.

7.  Plagiarism & "Flag inappropriate content"


After clicking on "Flag inappropriate content", you may notice that copyright infringement notifications are absent from the list of options that you can conveniently submit to Google on the following page. 



That is, unless you happen to notice the notation close to the footer of the page which reads as follows:

If you believe this content infringes upon your copyright, please see our instructions for notification of copyright infringement.



The instructions explain that the copyright holder must contact Google in writing rather than through e-mail or any other digital means, significantly complicating the process of reporting these types of violations.

Deciding that I didn't have time to get in touch with all of the contributors to plagiarized Wikipedia pages to ask them to contact Google in writing about a violation of their copyright, I decided to Flag inappropriate content and choose the Other option, which provides a text box where I could enter a note about the violation of the Wikipedia license by the Knol. In the absence of a response, I decided to write this Knol in the hope that other users will call on Google to act.

8.  The need for action


Failing to act decisively against plagiarism of web content is a disservice both to the creators of that content as well as to the users of Google Knol. Every plagiarized article that appears on Google Knol dilutes the authority of everyone's work that is published here.



9. Knol Action Petition


Please submit the URL of a plagiarized Knol that has not been acted on by Google following the submission of a flagging complaint.




Comments

Congratulations, Gold Badge

Dear Jay --

We are very pleased to announce that this Knol is the Gold badge winner for English Knols created in November, 2008. Congratulations. You may view your award at http://knol.google.com/k/peter-baskerville/top-pick-best-knols-of-the-month/14j3i4hyjvi88/60#.

Top writers like you may benefit from participation in the 'Google Knol LinkedIn Group', located at http://www.linkedin.com/groups?gid=2185205&trk=hb_side_g.

Please consider joining with us to add your point of view. Knol is listening!

Great work, keep it up,

Murry Shohat and Peter Baskerville

Oct 15, 2009 3:25 PM
Report abusive comment

hey whats it all about ?

so Do you wish for knol to be so closed as Wikipedia I don't get it can you please describe closer what you mean as "Spam" don't tell me that you want to make this site as Wikipedia which users throw up any materials that dosen't fit their commercial goals (Because the users are not permited to do any research) If you're looking for Plagiarism the Wikipedia is the queen of plagiarism. While Google Knols offer a healthy initiative to find articles with personal easy to understand knowledge and it openess to accept any articles proves to be an advantage for Google cause the facts are written there by specialists (people who know them) which is an advantage to us all. Knol really is an unit of Knowledge fact checked and not spamed or plagiarised !

Last edited Jul 12, 2009 5:48 AM
Report abusive comment

Call for action answered

Jay --
You called for action and a trio of authors has responded: http://knol.google.com/k/peter-baskerville/plagiarism-on-knol-report-it-here/14j3i4hyjvi88/51#.

Your Knol is excellent. We've quoted you in our Knol and included a link to yours. Now that we've had a few weeks experience, we think that Knol users are getting the message. Let's collaborate on this. Would you accept our invitation to join as an author-investigator?

Best regards,
Murry

May 14, 2009 6:08 PM
Report abusive comment

Reporting issues...

A 'report copyvio' button is VITAL. And after a certain number of copyvio issues, users should simply be barred from Knol, period.

Since users can also delete comments or reviews, simply marking a page that was obviously copied directly from Wikipedia with a comment ( like http://knol.google.com/k/-/-/4hmquk6fx4gu/72 ) or making reviews of people who do literally nothing but copy other websites ( like http://knol.google.com/k/the-judge/-/14g5pj4cyuk5u/0# ) isn't enough. Registered users need to be able to report violations anonymously, with nothing but the reporter and the Knol team knowing.

Last edited Feb 3, 2009 9:03 PM
Report abusive comment

It is an issue, It is not an issue

Somebody copying and pasting it in a blog is an issue and is not a big issue. Many times it is easy to spot a plagiarised version, as many odd statements appear in them. I have seen my entire blog being copied by another blog. My blog posts are copied in their entirety. I wrote I told this to my daughter, the sentence was copied verbatim. But do we have the time to go after such people instead of creating material which we feel is useful to people at large. Original authors do not have time to go after those people and technical platform providers also will not have time to go after those people.

I think, knol by providing information where similar content is available, provide a solution to the problem technologically. Even if a writer does not acknowledge the original source, the technical feature of the knol platform acknowledges.

I keep consistently writing in various places that knol authors should not use wikipedia materials. For that matter they should not use materials of any other publishing platform as knol is a competing publishing platform. They may use materials of various business concerns, as the business concern itself will be happy that it gets additional exposure to its area of business. But proper citation and linking is necessary to provide that benefit the business concern involved. Plagiarism is to be fought by a joint action committee of various publishing platforms and technological solutions like the one knol brought out have to be brought into existence.As Chris Anderson, Editor Wired magazine wrote, there is not much money to be made from the best blogs through adsense. Then who is going to make money by plagiarising some articles. One can leave them as nuisance. It is only when anybody is getting a lot of traffic on plagiarised article that one can make an issue of it.

Last edited Feb 3, 2009 7:26 PM
Report abusive comment

Similar content

You say above that the "Similar content" refers to it's Wikipedia alter-ego. This isn't quite true. It refers to any web page, hosted anywhere, that is similar. For example, when I copy some of my own authored pages, from my own web site http://www.countyhistorian.com, I can also get that information flag on my similar knol.

Last edited Feb 3, 2009 10:22 PM
Report abusive comment

Health Fraud and Plagiarism

I had one of the recent knol which had copied data without citing my knols removed under Health Fraud as it was trying to dupe patients with bogus cure. Several other knols under Health Fraud have been removed. Several thousand knols from known and confirmed spammers with Health Fraud are still there? It is an immense task to signal each one of these and inspite of past and recent FDA action and ban. Most of the Akai Berry knols for weight loss are Health Fraud and should be deleted in mass.

Your knol was highlighted?

Michael Can you add "Fraud" in the title. I think knol help is the place to discuss these points.

http://knol.google.com/k/krishan-maggon/health-fraud-knols/3fy5eowy8suq3/26#

http://knol.google.com/k/krishan-maggon/quality-assurance-health-group-knol/3fy5eowy8suq3/29#

Last edited Feb 3, 2009 4:52 AM
Report abusive comment

A page to discuss concerns about knol quality.

See here: http://knol.google.com/k/knol-help/spam-plagiarism-and-low-quality-content/si57lahl1w25/117#

You are welcome to continue your independent critique of Knol quality. We will do what we can to improve.

Cheers.

Dec 11, 2008 1:31 PM
Report abusive comment

Spam, plagiarism, and low quality content in Knol

Hello Jay -

I am preparing a Knol Help page entitled "Spam, plagiarism, and low quality content in Knol", where we will try to respond to the general questions you raise here. On that page I will follow up with the following items:

1) Identify actions users can take
2) Describe some of our work in progress on improving content quality, which you will be welcome to comment on.
3) Provide a place specifically to make suggestions and hold discussions on improving knol content quality.



Last edited Dec 11, 2008 3:26 AM
Report abusive comment
Article rating:
Your rating:
All Rights Reserved.
Version: 95
Versions
Last edited: Feb 27, 2009 6:09 PM.

Knol translations

Activity for this knol

This week:

64pageviews

Totals:

4300pageviews
12comments