Every now and then someone in the SEO community plucks up the courage to say something completely irresponsible and stupid about links. Usually it’s something like, “SEO is all about links.” Sometimes it’s more along the lines of, “Here is a great place to get links.”
Lately, however, people have been “testing” links to see if a 2nd link will pass value. That is, if you embed two links to the same destination on a page and each uses different anchor text, will the second anchor text be passed to the destination as well as the first? Some people are claiming you won’t see the second link pass value.
To which I say: Oh, really. I guess all those 2nd links I’ve been getting value from don’t count. Of course, how silly of me.
SEOs in general conduct poor quality tests. I mean, I’ve written about how SEO tests usually don’t work because of basic SEO incompetence and how SEOs do their tests wrong.
SEOs love to test nofollow links and make extravagant false claims and SEOs can’t test nofollow links.
Every good SEO should understand that link analysis can be misled by many factors. That is, link analysis sometimes sucks.
There are fundamental concepts that should govern all attempts at link analysis and proven concepts about link analysis that help you understand which links should work and why they should work as well as why they sometimes don’t work as expected.
But you can boil it all down to a few points:
- SEOs don’t know how to isolate factors in their tests (you pretty much have to do it by statistical analysis)
- SEOs don’t provide adequate allowance for other factors in their test results
- SEOs don’t include temporal factors in their testing
- When it comes to testing, most SEOs don’t have a clue
A bad SEO test consists of placing a link on a page, or setting up a few pages that are “optimized” for some unique expression. The worst tests, in fact, are usually conducted on nonsense expressions with little to no content. Without an environment that replicates natural search results, your testing won’t show you anything useful for search engine optimization analysis.
For example, if you want to know whether an H1 header helps improve your relevance score, creating five pages that each emphasize relevance in a different way invalidates your test. If you decide, “Well, I can jumble up the text and just put an H1 on a page” that may seem to make your test more valid but you’re not considering the way search engines evaluate proximity and word order.
The most reliable test model for evaluating on-page factors is to position pages in natural search results. You need to use a test set of at least 5 to 10 pages (the more the better, but larger test sets require more time and effort). Move the pages as high into the search results without the on-page factors you want to test as you reasonably can.
Then walk away for three months. Give the search results time to stabilize. If your pages drop in ranking you either have to make adjustment or just wait until they hit whatever ranking they’ll reasonably, naturally settle to. You should see a page hit its natural ranking somewhere in the 3 to 6 month timeframe.
When you’ve recorded 4 weeks of approximately consistent search ranking (no week varies from the previous or next by more than 2 positions), you’re ready to conduct your test. Now add you the on-page factor you want to test to your test pages.
Wait 1 month. If you cannot see the page’s new on-page content in the search engine cache, your test may be pointless for lack of crawling/indexing. In Google’s Supplemental Results Index, for example, pages may be crawled and indexed quickly and not recrawled for a very long time (although Google claims to have been working on this lag time, there are other problems with Supplemental Index pages).
If you see a change in results within 4 weeks, you need to remove your on-page factor from your test pages and wait 1 more month. If you see a change in results within 4 weeks, you need to add the on-page factor back to the test pages. Wait another 4 weeks. If you see the same change in results as the first time you add the on-page factor, it’s now safe to reasonably conclude that the on-page factor influences search results.
Real SEO testing takes time and resources and most SEOs don’t invest either time or resources in the testing process. You can engage in parlor trickery and SEOfoolery. A lot of people do that, usually with their linking tests.
Simply embedding a link on a page doesn’t tell you anything useful, even if you see the link appear to pass anchor text. Why? Because you allow other factors to pollute your test environment. Let’s look at possible linking relationships. NOTE: I call a page trusted if it is capable of both passing and receiving value. I call a page trustworthy if it’s unindexed (probably new) but on a site that has a lot of indexed/trusted pages (more than 50%). Just because I say a page is untrusted does not mean you should assume it won’t pass or receive value.
- You can link from an indexed trusted page to an indexed trusted page
- You can link from an indexed trusted page to an UNindexed trustworthy page
- You can link from an indexed trusted page to an indexed untrusted page
- You can link from an indexed trusted page to an UNindexed untrustworthy page
- You can link from an UNindexed trustworthy page to an indexed trusted page
- You can link from an UNindexed trustworthy page to an indexed untrusted page
- You can link from an UNindexed trustworthy page to an UNindexed trustworthy page
- You can link from an UNindexed trustworthy page to an UNindexed untrustworthy page
- You can link from an indexed untrusted page to an indexed trusted page
- You can link from an indexed untrusted page to an indexed untrusted page
- You can link from an indexed untrusted page to an UNindexed trusted page
- You can link from an indexed untrusted page to an UNindexed untrusted page
- You can link from an UNindexed untrusted page to an indexed trusted page
- You can link from an UNindexed untrusted page to an indexed untrusted page
- You can link from an UNindexed untrusted page to an UNindexed trusted page
- You can link from an UNindexed untrusted page to an UNindexed untrusted page
You don’t want to test all these different conditions at once (although if you want to try, you’ll need — in my opinion — at least 80 test pages but you cannot be sure what’s trusted and untrusted in your indexed pages).
You should never use a page to test more than one hypothesis at a time. That is, don’t perform two tests on a page at the same time.
Nurturing test pages is a bit like grooming a line of laboratory test rats over many generations. You have to know the pedigree of your test subjects. If you don’t know that then your conclusions are invalid because you don’t have any way of knowing why a test produces a certain result.
You have to be certain that you’re changing a stable page. You have to have a firm idea (based on solid evidence) of whether the pages you’re linking to can receive value, are stable, and don’t have other factors contributing to their possible relevance for a particular expression.
Just dropping links on pages at random tells you nothing. If you look at 10 different queries and use links to affect the results in 6 or more of of those queries (at least twice — remember the do-undo-redo component is vital for SEO testing), you have a statistical basis for concluding that your test results are predictable. In SEO testing it is absolutely mandatory that you statistically isolate the factor you’re testing in order for your test to be valid.
When you conclude that a cause-and-effect relationship exists, the established predicatability doesn’t mean you’ll always see the same results in natural search behavior. After all, in natural search behavior you’re looking at many different factors coming together at once. The search engines and the searchers have to figure out how they want the results to look. All you can do is put enough factors together (both on-page and off-page) and hope that you’ve met the challenge with sufficient resources.
Anyone can slap a link on a page and point to apparent results and say, “See! This proves my theory!” But they’re not testing or proving theories. They’re just practicing SEO parlor tricks, which serve no useful purpose (except maybe to attract links from gullible bloggers).
And that’s not all. But I’ll leave the rest of the discussion for another time. I’m sure there will be plenty of opportunities to comment again on faux SEO link tests in the future.
{ 6 comments… read them below or add one }
OliverTaco 06.06.08 at 11:20 am
You know, I wish you wrote more - I really enjoy your articles. Hmmm, causality?
My wife is a scholar who works with very large (30K to 300K) data sets about people’s work behavior, so while I am no statistician, I have watched a world class scholar work on large data sets. And what really strikes me is that without being able to construct a “why” around the behavior it’s pretty easy to get stuff wrong.
So, with humans, you can say, “Well, lower income people with lower education tend to start lower growth lower capital business for X, Y, and Z reasons.” And it makes sense.
With google moving pages round due to an algy with 200+ tweakable features, well, good luck with that!
-OT
sasa 06.06.08 at 12:32 pm
Yeah, if you take into account that really good tests take a really long time and that Google makes like a couple of hundred changes to the algo a year, no test - no matter how thorough - will in the end be of any use by one month after it has been conducted. Conclusion: SEO testing is virtually useless or impossible unless you probably spend like a million bucks a year on R&D.
But if you tell & show ppl that you do all these little tests and give the illusion that what they represent is the truth and nothing but, you *are* doing the better marketing. And we all know who gets all the business. It is always the better marketeer, not the one ranting like a mad man although he in the end might even be right or better or both
DangerMouse 06.06.08 at 2:19 pm
What statistical analysis techniques would you suggest to isolate factors?
Michael Martinez 06.06.08 at 7:43 pm
It doesn’t matter how complex Google’s algorithms are or how often they change the search engines. You can still observe trends in search results behavior and those trends can easily be documented over time. All it takes is patience.
Think of how complex the Earth’s ecosystem is. It’s much larger, much more active, much more complex than a search engine. Nonetheless, we tag animals, watch plants grow, send planes into storm clouds, and capture large amounts of data for statistical analysis in thousands of ways.
With search engine results analysis the SEO community just needs to expand the scope of its investigative exercises a little bit. Instead of tracking one page with an obscure query people need to track 10 pages in active, natural queries.
The scientific method works just as well in documenting search engine behaviors as in documenting the behaviors of schools of fish, flocks of birds, herds of cattle, clans of meerkats, etc. You begin with observation and collect as much data as possible.
Unfortunately, the SEO community prefers to collect as little data as possible and to draw as definitive a conclusion as possible. That never works except by random chance, in which case you might as well draw explanations out of a hat. You’ll have as much accuracy with that method as with most SEO tests.
Michael Martinez 06.06.08 at 7:46 pm
DangerMouse, the 10-page test method isolates the factors for you (crudely but adequately enough). By performing the test in 10 different queries on 10 different sites, if you achieve similar results most of the time you can be reasonably sure that whatever factor you’re testing is the primary cause of the changed results.
Most people don’t have the time or resources to perform a more precise test than that.
bwelford 06.07.08 at 8:47 am
This is an excellent cautionary message, Michael. However there is an additional factor that makes testing even more difficult.
As a mathematical statistician, I would counsel that it’s always worth making sure you understand all the variability in the field of study before designing your tests. The best analogy I have for the results of any given keyword SERP is a turbulent river. OK Google may be tuning its algorithm on a daily basis. Unfortunately the world is not standing still. Others may be creating web pages or may modify existing web pages so that even with a static algorithm rankings would change.
If you are aware of all these random factors, then you can determine what you might learn from any given test. Your assumption should be that there is lots of random noise in the system. So you should do adequate repetitions of any test and look at the average behavior.
Given all this, I believe it is almost impossible to reverse engineer the way Google does keyword searches. Thankfully they are trying to produce relevant pages for human beings. So perhaps your judgment of what is relevant to humans may be as important as whatever you ‘prove’ from your tests.
You must log in to post a comment.