I’ve been tweaking some of my websites lately, and noticed this website finally got it’s page rank back. Admittedly it’s only PR1, but that’s OK.
I’ve found a flaw this website inherited from the previous CMS (Nucleus CMS) which I believe has been affecting this website’s search engine ranking, and had a detrimental affect on other sites I’ve been linking to.
Nucleus CMS was an offshoot from Wordpress, but the branch in the development tree didn’t go far, and therefore I eventually switched over to a clean install of Wordpress.
Unfortunately my outdated and un-patched version of Nucleus CMS had a an exploit allowing spammers to inject spam message posts, and these were subsequently indexed in the Search Engines.
Even though the clean install of Wordpress has no such exploit, and those posts are removed, the “Hammy” URLs still exist in the SERPs because Wordpress basically allows you to throw it any URL.
The Hammy URLs look like domain.com/?STRING=blarg. I’ve removed the string name due to it’s hammy nature, but let’s just say it’s related to Pharmaceuticals, and is therefore objectionable.
I’ve seen this case before where a popular and well ranking website gradually loses it’s rankings because they have an un-moderated or outdated and exploitable forum, and all of a sudden bad content is injected, and the ranking gradually declines.
As the URLs vary, but have a common string being, I needed to create a filter or block on this URL using a wild-card.
I wracked my brains trying to come up with a .htaccess solution to this problem, and I’m sure such a solution is available, but ultimately, I think robots.txt is the way to go.
The Robots.txt entry I’m now using is:
User-agent: *
disallow: /*string*
I’ll be monitoring the SERPS over the next few weeks to see if these entries are removed, and see what sort of impact it has on my previously high ranking sites that this site is linking to.