Once your internet footprint reaches a certain size, chances are people will start scraping your content. Matador contributor Eileen Smith shares a few thoughts on what happened to her.

View from author‘s window on perfect day.

I WAS pre-coffee tweeting one morning when I saw a tweet on winetasting in South America, a story I had submitted a few days earlier.

Oh good, I thought, my story is published.

As a freelancer, especially one who writes for the web, even with Google alerts it’s hard to know sometimes when something of yours is going live, and you have to keep your finger on the pulse (or watch your blog traffic) to see what’s up.

Five minutes later, stovetop espresso in hand, I clicked through the link I’d sent my followers. The whole story was scraped. The story which I had pitched, had accepted, researched and written specifically for publication had been lifted, wholesale and placed elsewhere. For free.

Scraping is stealing someone’s content and posting it as your own. In the past I had seen bits and pieces of what looked like my stuff, and even photos I’d taken posted elsewhere. I would write a little, hey, you-know-what email, and usually get some satisfaction, a link at least.

But this? This had my editor messaging me asking if I’d double-submitted, a major no-no in this incipient industry. It also had me wondering just what had gone wrong. It happened that the site which had scraped my article belonged to someone who had recently asked me to do a guest blog post.

I hesitated for a minute, wondering if I’d somehow given permission for him to steal the content. Classic blame the victim mentality.

In the end, my editor contacted the offending party, who removed the content. I retweeted the real URL, and I sat, and fumed, downing more coffee, waiting for an apology that never came. I contacted some people with thicker skins and more years on the job than me, and came away with some different perspectives, and posted my frustration on my blog, where I knew the scraper, my editors, (and every other visitor, and maybe even some of you) would read it.

The question of when content scraping will happen to you is not so much if, but rather when. Do something out of the ordinary, or achieve a small amount of notoriety or write something clever and sit back and relax. Anyone, anywhere can lift your work and pass it off as their own, without so much as a credit, link, or thank you.

So what’s a creative, prolific person to do?

You could not publish anything, anywhere, keeping it all for yourself and under lock and key. Ick. You can watermark photos, or use Flickr’s “all rights reserved” stamp, (though this amounts to nothing more than a “pretty please don’t steal my photos, thanks”).

Writing is trickier. The written word is easily cut and pasted, or retyped from print onto a blog. South African infertility blogger Tertia Albertyn found several entries from a published book she’d written (So Close: Infertile and Addicted to Hope) posted on another blogger’s website.

Julie Schwietert, managing editor at Matador and one of the people who held my hand through my scraping experience, told me about a Cuban photographer friend of hers whose photo she’d seen in a gallery in New York.

He doesn’t follow up on these cases, he says, because the energy required exceeds the benefits he would reap. It’s not that he necessarily throws photo licenses into the wind, just that he knows that realistically, he will make himself sick with effort at trying to track all of these infringements down.

David Miller, Matador’s senior editor, has another take on artists’ rights, which he explained to me over Spanish tortilla one evening in Santiago. He believes Creative Commons licenses are the way to go.

CC defines themselves as “a nonprofit corporation dedicated to making it easier for people to share and build upon the work of others, consistent with the rules of copyright.” CC has gained popularity via Flickr, where users are allowed to specify that the works can be used with credit, for financial gain, or not, etc. Artists using CC have the benefit of increasing their internet footprint, with the possibility of remuneration coming via special projects. A good example is Trey Ratcliff, the most popular travel photographer on the web.

6 Thoughts on Content Scraping

1. Expect it. If you’ve got it out there, expect it to turn up somewhere else.

2. Prevent it. If it’s important to you to prevent it, take steps to do so. Hide it, watermark it, post it as an un-copyable PDF.

3. Find it. Go out and troll likely thieves, search uncommon character or word strings or check your Flickr referrals and see where people are coming from. Often, someone has linked to your photo from Flickr, and not rehosted it, which makes the theft easy to track.

4. Defend it. If you’re irked, set your editors, your blog readers (like Tertia’s), and other bloodhounds you have working on your behalf to storm the castle. Ask politely for the content to be removed. Grow steadily more insistent if they refuse or ignore.

5. Accept it. Take a page from Julie’s photographer friend’s book, and realize that it’s more important to hone your craft than it is to chase down wannabes.

6. Do an end-run around it. By marking your work Creative Commons, you increase exposure. Consider that disseminating your work (even freely) does not cheapen your ability to express yourself, and if you develop your craft and to the point where you have your own voice and vision, no one will believe that anything you create belongs to someone else.

Personally, I’m working on moving towards step 6, but I must report with sadness that I’m still in the capitalist grabby mindset that what’s mine is mine, and it’s not yours to show, publish, make money from or claim as yours unless I give you permission. Let’s see how far that gets me.

Community Connection

Matadorians, where do you find yourselves? Has your content been scraped? Did you follow up? Are you ready to go Creative Commons all the way?

Blogging
 

About The Author

Eileen Smith

Eileen Smith is the editor of Matador Abroad. She's an ex-Brooklynite who's made a life in Santiago, Chile. She's a fluent Spanish speaker who can be found biking, hiking, writing, photographing and/or seeking good coffee and nibbles at most hours of the day. She blogs here.

  • http://meganahill.wordpress.com Megan Hill

    How horrifying. This has never happened to me, luckily, but this sounds like great advice. The internet makes it easier to scrap material, but it also seems like it makes it easier to rectify those situations. Thanks for the tips!

  • http://www.roamingtales.com Caitlin @ Roaming Tales

    You lose a lot of rights with Creative Commons. Most of my content is All Rights Reserved. I know I can and do get scraped anyway but I can pick and choose when I want to do something about it, whereas with a CC licence I have to just accept it.

    When the content is hosted in the US, you can send a take-down notice under the Digital Millennium Copyright Act directly to the internet hosting company. You can send this yourself, you don’t need a lawyer to write the letter. The internet company should immediately remove the offending material or perhaps take the entire site off-line. I have had some success forcing content scrapers to remove my stuff simply by threatening to do this.

    But I don’t have time or energy to chase every infringement.

  • http://www.bearshapedsphere.blogspot.com eileen

    Yeah, agreed. First, shocking! second, grrrr. That’s basically the way I look at it. I am not at the point where I’m ready to share everything for free, but I have to believe that as I carve out my space in the world, my stuff will be mine because people will recognize it as such. Full of myself? Probably. Likely? Who knows.

    Thanks for commenting!

  • http://www.theroadforks.com Akila

    Oh, that’s awful! One of my very first worries about blogging was content scraping because I care so much about my creative property. I have not had it happen to me — but I have only been blogging for six months — and I am fairly assured that it is going to happen at some point especially because we post so many images on our site.

    One of my law school professors was one of the original creators of the Creative Commons concept and he pushed it incredibly hard but I do not think it is the solution. First problem: most people don’t understand what Creative Commons means. Each author/publisher must specify the type of creative commons license he or she uses and a user must follow that certain type of license (for example, some allow derivative works while others don’t.) Second problem: Creative Commons assumes a public that shares responsibility and respect for others’ work product so that the right people receive the right sort of attribution. If these content scrapers are capturing fully copyrighted work, why would they take the time to check on the type of license? Third problem: creative commons dilutes the original author/artist’s work. This is a huge philosophical legal issue but I am yet to see a thoroughly convincing article on the dilution problem and erosion of copyright law.

  • http://www.Travel-Writers-Exchange.com Travel-Writers-Exchange.com

    TWE has been scraped numerous times. It’s amazing when you receive a Google Alerts and you recognize the words staring back at you. When you click on the link, it takes you to another website. Sometimes we receive a link back to TWE, sometimes we don’t. Sigh…What can you do? Hopefully, karma will pay a visit to these websites. You never know.

  • http://traveldroppings.com/articles/great-trans-ocean-travel.html traveling_mike

    Luckily my writing is not very good so anyone would be a fool to scrape my ass….

    See, told you it’s not that good.

  • http://www.birgirthor.com Birgir Thor

    I ones found one pf my photo in a photo-contest – it did not win any prizes so I did not do anything more then contacting the host of the contest and asked for removal of this photo. It was some 16 year old guy that had sent it in.

    but this is just what we get when we post on the internet, it is better to just keep on with our lifes and not let those people bother us to much :)

    But great article on the subject.

  • http://www.siliconbeachtraining.co.uk silicon beach training

    This happens to me all the time, it’s really annoying.

    My main worry is duplicate content and google penalties.

Blogging →

Anne Merritt rounds up 20 sweet expat blogs from around the world.

Blogging →

Polyblogamy, having or keeping more than one blog, is safe and can actually bring you...

Blogging →

By using a child theme you can begin playing with your blog's typography, colors, and...

Blogging →

Keep your blog alive as you travel through the developing world.

Blogging →

Using social buttons to share your content is essential and easy.

Blogging →

Migrating your blog content from Blogger to WordPress is easy and super advantageous as...

Blogging →

Matador has compiled dozens of articles on writing tips, blogging, social media, and SEO...

Blogging →

Over the past few weeks we've had some unprecedented levels of dink behavior. Here are...

Climate Change →

Blog Action Day is October 15. Will you participate?

Volunteer Guides →

Learn how travel bloggers are fighting hunger and poverty worldwide, and what you can do...

Blogging →

You wish your visitor counter was a little bit higher every time you visit an internet...

Writing →

There's truckloads of text out there on how to write about travel, but hey, what's...