Google’s desperate gamble

Posted by Michael Martinez on August 6, 2007 in Web spam

Although it’s still a little too soon to suggest that Google’s downward spiral has finally begun, Google has lost search market share two months in a row. As near as I can determine (based on largely unreliable third-party estimates), Google began losing market share in late May.

Late May. That would be about the time Google rolled out Google Universal Search, which they said would take a little time to be felt in their search results. That would also be about the time that Microsoft and Ask launched new offensives against Google’s market position.

But Ask’s campaign has fizzled, if not actually backfired, and people are overlooking the fact that Compete says Microsoft’s ClubLive was not responsible for their increased search market share. And in my own ongoing analysis of search referrals compared to 2006, I have noticed a significant decrease in the growth of referrals from Google since May.

My personal network’s performance is hardly a bellwether for the Web, especially since I’ll leave my network to its own resources for long periods of time. But I have been actively building out content, redesigning pages, and reoptimizing portions of my network for several months. I’m achieving my targets with respect to Google rankings but the growth in referrals has been cut in half.

But Google also missed an earnings estimate despite increased sales of online advertising. Wall Street may not know anything about search technology, but they can read income and balance statements. On the other hand, Google has been plagued this year by high profile resignations, advertisor pullouts, and lawsuits or government demands for private data disclosure, changes in procedures, enforcement of copyrights, and more.

Everyone wants a piece of Google right now and the users don’t all appear to be happy enough to keep Google in the driver’s seat. When it comes to Web search this is still Google’s game to lose but they have made some spectacular technological blunders over the past year, including:

  1. Dumping millions, possibly billions of unique content pages into the Supplemental Results Index with no means of achieving search visibility
  2. Indexing thousands of students’ social security numbers from more than one district
  3. Granting Wikipedia unprecedented dominance in search results
  4. Pretending that Supplemental Results don’t exist and won’t be treated unfairly in search results
  5. Forcing logged in users to rely upon personalized search and search history
  6. And someone seems to have spammed Google Maps

Google has done some technologically good things this year, of course. They’ve expanded services through Webmaster Central (but that doesn’t help the search experience); they’ve expanded their custom search technology (but that doesn’t affect their Web search interface); and they’ve joined the growing list of search engines to offer integrated search.

But is integrated search all that it’s cracked up to be? I can no longer find any of the news stories about it but an Indiana school district filed for an injunction against Google’s Universal Search rollout within a matter of days after the May 16 Searchology press conference (where Google announced its Universal Search Solution). Was there a connection between that injunction and what appeared to be a rollback in Universal Search behavior (which, in theory, would have allowed Google time to make some technological tweaks)?

Google is juggling lawsuits over copyright infringement, trademark dispiputes, and privacy issues. They have consistently lost legal battles in the European market. They have had to retrench on YouTube content because of actions taken by industry service groups such as RIAA and some foreign governments.

Google has been fighting a losing battle against search engine spam for several years. The quality control measures have, on more than one occasion, forced millions of non-spamming sites to lose visibility. Google usually responds to the consequential screaming by loosening the ropes with which it has tightly bound its data, but they seem to be determined to enforce Web apartheid more than ever.

Google introduced an early form of Web Apartheid with its Supplemental Index in 2003. But they showed that trust is the foundation of Web Apartheid with their unintentional Sandbox Effect in early 2004. By late 2005 many people who had been sandboxed the previous year understood Google’s trust factors well enough to understand they needed more links: the resulting link glut forced Google to change its strategy.

The Bigdaddy update of December 2005 - March 2006 changed the rules by implementing full Web Apartheid. Only those sites that had an arbitrarily minimal amount of internal PageRank were permitted into the Main Web Index. Every other page was sent to the Supplemental Results Index. As Google recrawled the Web throughout the rest of the year, more pages fell into the Supplemental Results Index for lack of sufficient PageRank.

After Google stopped displaying query data in its cache for Supplemental Results Pages, I compared several test queries with early 2006 results, concluding that Google might have demoted up to 80% of its Web index to the Supplemental Results.

My guesswork may not be flawless, but the numbers of unsearched pages were alarming when compared to the immense numbers of complaints voiced in blogs and forums about sites losing traffic referrals from Google from mid-November 2006 onward. It seems obvious that Google had taken a drastic step in its fight against spam (ignoring the fact that a simple study claims 75% of Google’s Blogspot consists of spam).

But how does demoting a majority of the Web to virtual unindexed status fight spam when the spammers are savvy enough to build out link networks that help the spam pages enter the Main Web Index? The real problem here is that Google is fighting two battles and losing both of them.

The first battle is the massive increase in Web spam that Google itself has fueled through its immensely successful AdSense program. It’s easier than ever for people to earn at least a few dollars a month off of nonsense text compiled from search engine alerts that are auto-posted to blogs. The more money spammers can earn from advertising, the more spam Google will have to deal with.

The second battle is the massive increase in artificial linkage that Google has encouraged by misleading people about the importance of PageRank. Spammers and many SEOs value links chiefly for the anchor text they can use to spam pages to the top of search results, but Google has chosen to ignore the fact that people are riding the anchor text wave.

Now Google has made it almost mandatory for pages to get at least some internal PageRank so that they can rank in search results, thus forcing Webmasters to seek links for precisely the one reason Google says it doesn’t want people to seek links: to acquire PageRank. While this mis-shapen strategy may dilute the amount of targeted anchor text being created in link networks, it will only increase the intensity of linking because people will now act with less constraint.

Whereas yesterday’s SEO advice was to acuire only “relevant links”, now people will be seeking “PageRank links” because that is what they need to get their pages out of the Supplemental Index.

Good move, Google. Maybe you should just hide the “Supplemental Results” label and pretend there is no Web Apartheid, for surely instead of dealing with the problem (your allowing links to pass anchor text) you’ve only made the problem worse not only for yourself but for everyone else, too.

For the first time since 1998, pages now actually need substantial PageRank in order to rank in search results — which all but guarantees that large content sites like Wikipedia will dominate search results despite the fact that they are generally unreliable sources of information.

And that may be why people are starting to turn to other search services. Or it may only be a summertime fluctuation and Google will spring back in July’s search market share reports.

Like I said, it’s too early to be sure of anything, but I needed to write about something and wanted to set the stage for a few “I told you so” type posts down the road if Google continues to practice Web Apartheid.

As a searcher I am tired of not being able to find content that I know is out there.

9 Comments on Google’s desperate gamble

By Gids on August 6, 2007 at 8:04 am

Hi Michael
Thanks for another very interesting post.
Because Google has been so successful people tend to forget that, just like any other company, when it looses key personnel; slightly less experienced people take the positions and the switch can have profound effects.
I think many of us suspect that behind the (enforced) happy Google façade there might be trouble brewing…
PS When will Google (and Wikipedians themselves) learn that Wikipedia isn’t the font of all knowledge?

By tinkerbellchime on August 6, 2007 at 7:09 pm

I’m going to upset you here, Michael. I use Wikipedia 10+ times a day. I’m a big fan. Couldn’t live without them!

Why? Partly because of the nature of my site, which needs to be very simple with just a tad of general information due to its core audience, but also because with Wikipedia I don’t have to log in all the time with passwords. Encarta would be a better choice, but I can’t be bothered with logging in over and over again. I try to keep everything open at once, but you know how that goes. Somehow I manage to accidently close things.

Why else? Wikipedia is the best choice for searches more times than not. This is the fault of web developers who won’t provide enough good information on THEIR sites. If I were Google, I would keep Wikipedia on top of the front page in most cases, too. Yes, it’s simple, watered down information, but I’m learning, like many have, that I just can’t read long, deep articles on everything because there just isn’t enough time in the day. Plus, a quick overview is a great starting point before digging deeper.

Don’t be a Wikipedia hater ;-) Today there is a nice short article on the European Parliament on their front page. Reading their daily front page features keeps me informed, much like a reading a daily newspaper does. I know the facts won’t be 100% right, but I only want a nice overview on something new for the day.

On a similar note, I’ve found that I have to limit the number of SEO blogs that I read…also due to time constraints, and your blog is now my ONLY ‘must read daily’ SEO blog.

By Michael Martinez on August 6, 2007 at 7:47 pm

I’m not into hate. I just refuse to use or endorse a resource as unstable and unreliable as Wikipedia.

Not only do many teachers and libraries now forbid students to use Wikipedia, it’s just not a reliable source of information for many topics where the only contributors are well-meaning but poorly informed people.

As a Wikipedia contributor who has repeatedly watched good information be replaced by propaganda and nonsense, and as someone who has reveiwed numerous comments by former Wikipedia admins who were frustrated with the lack of standards there, I just know better than to trust Wikipedia’s “information”.

Convenience is just not a plausible justification for promoting Wikipedia to the top of search results. Nor would I ever want to link to any article at Wikipedia, since I would never know how many people might be led to wrong information (which can stay in place on Wikipedia for months).

By tinkerbellchime on August 6, 2007 at 9:05 pm

You do have a good point about convenience not being a plausible justification for promoting Wikipedia to the top of the search results. My daily use of their services is helping them stay on top. If Encarta didn’t make me sign in all the time, I’d be using them instead. So, why can’t Encarta make their product easier to use and less of a bother? Well–I promise to try to keep both sites open for now on, so I can spot check information to see if they both agree on the same facts.

You’re right that teachers are asking students to not cite Wikipedia as a source. My take is that it’s okay for grade schoolers to use it, but not for college students. High schoolers? I’m not sure. I’ll bring it up in a few educational fourms and see what kind of replies come in. (I think I know someone who would put such a survey on his blog. I’ll let you know.)

I just read one of your old comments where you said that you wrote a Wikipedia article on SEO and that you were ranking number one for SEO theory. You’ve got at least four listings on Google’s front page for the term “SEO Theory,” so keep up the good work. I think that interest in the subject will increase because most of the SEO blogs keep going over the same material. With the learning curve increasing, people eventually start looking around for something meatier. I’m tired of “great post” comments and being told to use headings. So, great post, Michael :~) !

By the way, you also said that you could beat Wikipedia in rank for a term if you wanted to. I believe that you could and so could many other people, but they don’t write enough content to actually do it because it’s time consuming.

If you had only ONE book to recommend to someone beginning SEO, what would it be? Don’t give me a top ten list…only ONE. Later, I’ll ask for more.

By Michael Martinez on August 6, 2007 at 10:52 pm

Only one book? The SEO Theory Blog.

By tinkerbellchime on August 7, 2007 at 3:14 am

Very funny, but o-tay.

By bestoptimized on August 7, 2007 at 2:12 pm

I don’t know if you have read The Long Tail but that book Chris Anderson cited a study that came to the conclusion that Wikipedia articles have only slightly more errors than Britannica articles (I have not read the original study). I like Wikipedia. I think for the most part it is pretty accurate but because of their open system there are errors. I sometimes lookup keyword phrase wikipedia to see what their article says. I do agree that they have too much prominence in the search results. It is stupid to put them on the top for commercial results and some noncommercial results where there are many better sources.

By Halfdeck on August 7, 2007 at 2:22 pm

Wikipedia annoys SEOs because its in-your-face proof that you don’t need to build links, buy links, or do any kind of cheezy marketing (like submitting a Wikipedia page on Digg) to rank high for gazillion keywords. All you need to be is sanjaya on the web (can’t sing all that great but get enough votes to rank in the top 10). Yeah its a popularity contest, and millions of websites “voted” for Wikipedia dispite its shortcomings. Why?

Because when you’re looking for information, content on Wikipedia is better than 99.9% other websites out there.

Considering the information you find on Wikipedia isn’t necessarily all that accurate, that’s a sad fact of life - that information you find elsewhere is just as misleading if not more. On top of that, the intention of the information on other websites isn’t necessarily to inform you but to trick you into clicking on ads.

Now that’s not even about Google Trust - it’s about people’s trust - to the extent that people trust George Bush’s honesty even though Bush may not know what he’s talking about.

By Michael Martinez on August 7, 2007 at 3:33 pm

bestoptimized, the study you refer to only looked at scientific articles, and the most likely contributors to Wikipedia’s scientific articles will be scientists, engineers, and teachers. The non-scientific sections of Wikipedia, particularly those dealing with literature and biographical entries, are rife with “original research” (which is forbidden by Wikipedia), factual errors, biased comments intended to influence people to see the topics in a certain way, and other egregious problems.

Wikipedia is in no way “accurate for the most part”, but even if the issue of accuracy could some how be addressed, the fact that Wikipedia content can be changed by anyone at any time makes it an unworthy reference. Your link today to an article you like could tomorrow promote something you totally disagree with.

Comment

Log in or Register to post a comment.

More

Read more posts by Michael Martinez

About the Author

Michael Martinez is the Director of Search Strategies for 1st Query, an Internet Marketing firm offering organic SEO and PPC services.

The measured nonsense of SEO relevance Diary of a Mad SEO Gooroo