Since creating Worth1000, (a terrible example of SEO), I have learned a lot about what works best in getting search engines to crawl and understand what your page is about. I have since created Plime, which is a great example of an optimized page. In this post, I’ll outline the theory I used in laying out my site to improve it’s readability by search engines.
This article is not about ranking higher in search enginges. That’s the next step. It’s about structuring your content in such a way that search engines will understand it and index it better.
Before I begin let me just explain the logic here. Think of a search engine as a stupid computer program, and understand that the program is programmed to read the data on your website in the form of a semantic hierarchy. The html on your site is really designed for people to read, so the trick is to also design it so that search engines can read it as well, in a way that doesn’t interfere with what people see. A well designed SEO page will not effect your page design AT ALL.
By semantic hierarchy I mean the program is looking for the following items, in this order to determine a page’s content:
- Domain name
- Page’s Title
- Page’s URL structure (excluding domain)
- Page’s Headline
- Page’s Subheadline
- Page’s Sub-subheadline
- Page’s content
- Bolded and italicized words within the content
- Links within your page, linking to internal pages (breadcrumb trails,left navigation and right article list)
- Links within your page, linking to external websites
I’ll go through each section with some tips on how to improve it and explain the logic for search engines and people, where the change will effect what people see.
Domain name
If someone owns “plime-trees-are-delicious.com” and you start an article called “plime trees are delicious” they might get higher in search engine results for “plime trees are delicious” if anyone links to them, because domain names are the most important element in most search engines (since they are not editable, unlike html).
Page’s Title
Always include your site’s name (like Worth1000.com) and the actual headline for the page you are at. (Worth1000.com | Out of Bounds Contest) in your title tags. Search engines give this a lot of emphasis.
Page’s URL structure
If all of your pages have dynamic extentions (like .asp or .php) and use querystrings to point you to different dynamic articles you’re making a mistake.
An example of a bad URL would be: http://www.worth1000.com/contest.asp?contest_id=1163
This is bad for two easons:
- PageRank effects articles at the document level, which means querystrings are excluded (so you won’t develop separate pageranks for different articles… all will share the same rank).
- Some search engines and stat logs cut off the querystring whenindexing your content. Having a link in someone’s stat logs or in a non-Google engine that point to “http://www.worth1000.com/contests.asp” is useless and is a wasted opportunity to get more visitors and pagerank.
Structure your URLS like this instead:
http://www.worth1000.com/contests/photography/1163-Title-of-the-page.html
All that’s important in there is the number 1163, but you’re already getting some important extra keywords in there that will help with SEO.
I don’t even need to change my code. I can simply use a program called modrewrite which will redirect that URL to my actual page: http://www.worth1000.com/contests.asp?contest_id=1163 (the rest of the stuff is ignored by modrewrite, so I can use anything there). Search
engines will see it as a static page, even though it\’s exactly the same as it is now.
This also has the added bonus of users who link to this URL in forums that automatically parse URLS will now have some keywords as a part of the URL itself. This kind of clean url will definitely not annoy users either.
Page’s Heading and Subheadings
Don’t use fonts and CSS to make a headline appear big. Wrap it in <H1> tags instead.
A note to SEO folks: This tag is not as deprecated as you might think. <h1> is still semantically telling the search engine “this is the most important headline on this page” <h2> are less important for subheadlines, but if you have subheadlines n an article, using them can only help the spider understand how your content is structured and that is a good thing.
So long as you style the <h1> in the CSS the same way you have been making larger headlines in the past, users won’t visually see a difference at all.
Best of all about this tip is that very few sites make use of it.
Page’s content
This is your keyword text. Just make sure you have alt and title tags for all images, so search engines can read them. I’d
use <h3> for all captions under the images, since images (and the captions that explain them are usually important to the article. Search engines place minor emphasis on words that are put in between bold and italic tags in your content. Very minor emphasis, but just worth noting that if an article is about a certain celebrity for instance, and there name appears multiple times, bold each appearance of the same and it will help the spider think that celebrity is the focus of
the page. Don’t keyword stock though. Search engines will notice if the same words are appearing too often (called keyword density) and it will annoy your users in any case, so don’t try to stuff your page with redundant keywords. Always write your articles for your users.
Links within your page
You need to nake your navigation load before the page’s content so that spiders can access the rest of your site. You can use CSS to make this not effect your positioning of the navigation. This will help your links get indexed better. When a spider visits your site it sees the page as one huge block of html code, no matter how it’s laid out, so where in this block certain elements are placed is very important. Remember that using CSS you can have an element display visually to users first on the page, even if it’s the last thing to load in the html text block for spiders. If those links are all at the bottom of the text it interprets it as not important, and in some cases may not even download enough of the page to see them (some search engines only download the first 50,000 bytes of html).
Link to your sitemap prominently on your homepage if you have one. If you don’t then make one. This is really for the search engines more than the people, but it should even be helpful for those people who do see
it.
All links in your page should include title tags. <a href=”http://www.mysite.com/blog/” title=”blog”>blog</a>.
Links outside of your page
When linking to any content that isn’t related to your site (let’s say ou’re plugging a blog article about tech you enjoyed, but your site is about catfood), always add “rel=nofollow” to the hyperlink or else Google follows it and gives some of your outgoing page rank to them and also may assume that your sites are similar (since the logic is that sites only link to other
sites that will be interesting to their visitors).
Here is what I mean: <a href=”http://www.blog.com/whatever/” target=”_blank” rel=”nofollow” title=”great read”>great read </a>.
If the site is something you think is related or your want to pass pagerank leave rel=nofollow out.
[…] Worth1000 Words – Good SEO Practice A blog entry that discusses the best way to set up your website structure to make it easier for spiders to crawl, the White Hat way. […]
seoline
…wrote a very informative article on…
ONLINE – DRUGSTORE!
PRICES of ALL MEDICINES!
FIND THAT NECESSARY…
VIAGRA, CIALIS, PHENTERMINE, SOMA… and other pills!
Welcome please: pills-prices.blogspot.com
NEW INFORMATION ABOUT PAYDAY LOANS!
Welcome please: payday-d-loans.blogspot.com
GOOD LUCK!
Searching for the Answer
Searching for the Answer
One thing you did not point out was that pages should use words in the URL and separate them with HYPENS, not underscores. When Google parses URLs the underscore makes it look like one word. Use hypens.
hyphens…. sorry about the spelling.
this was an extremely good post, but I have a question:
how did you find out that this was the actual way that the engines crawled sites?
I mean, not that I doubt it – it makes perfect sense; i merely wanna know if there is an official site or something – or if you’ve actually found something that stated that this was what the engines truly did.
Thanks for a great post!
Bryan
Nice tip! (Links outside of your page)
Something you maybe able to help with as i cant seemed to find the answer?
Is there good reasons to not use upper case characters in a url. ? IE: What harm will it do?
Thanks
Chris.
good work, will back soon, great site congratulation!!
Basic information but in very logical way… i got some info here.
This is great information, thank you so much.
What a well thought out and presented post! I wish I’d writen it. Thanks.
If you ever want to read a reader’s feedback 🙂 , I rate this post for four from five. Detailed info, but I have to go to that damn msn to find the missed parts. Thank you, anyway!
I rate this post as a 10 outa 10 fool. Whoever says otherwise dont know whats up. I’ve learned alot about seo, thanks.
staff,
makingcolor
Learn more at Web Designers In Houston
[…] Worth a read. & this one. . __________________ Regards Chris, Sites are view in FF & IE, with res @ 1024 x 768 on […]
Actually the domain name is not the most important part. In fact, you can get to the top of Google without having the keyword in the domain name. It’s done all the time. I have client domain names that are in the top of Google and don’t have keywords in their name. So, I disagree with that statement.
Thanks for the great info. If everyone will read this before trying to do their own SEO work, then less mistakes will be made.
[…] się rozpisywał, ponieważ wiele już zostało na ten temat napisane ->podstawy-optymalizacji, good-seo-practice, […]
Very interessting SEO-blog.
Enjoyable read. will try to come back to this blog in the
future.
This is the right site for anyone who really wants to understand this
topic. You know a whole lot its almost tough to argue
with you (not that I really would want to…HaHa). You definitely put a fresh spin onn a subjct that’s been written about for
years. Excellent stuff, just wonderful!
Découvrez notre solution pour vos envois b2b
de SMS. Utile pour les particuliers, notre service vous permet depuis votre poste de travail de programmer vos SMS à l’attention
de tous vos prospects. Réalisée pour envoyer un très grand nombre de SMS, notre API SMS
est très rapide pour effectuer vos envois en masse.
Doté d’une puissante interface de gestion, notre solution est le top des solutions pour l’envoi de SMS et de
gestion des campagnes.
Vous élaborez créez vite fait vos messages à
vers des centaines de cliens.
Oh my goodness! Impressive article dude! Many thanks, However I am experiencing difficulties
with your RSS. I don’t know the reason why I am unable to join it.
Is there anybody getting similar RSS issues?
Anybody who knows the answer will you kindly respond? Thanks!!
Hello, Neat post. There is a problem with your site
in internet explorer, would check this… IE still is
the market leader and a large part of people will leave out your wonderful writing because of this problem.
Hi it’s me, I am also visiting this web page regularly, this web site
is genuinely pleasant and the visitors are genuinely sharing good thoughts.
Pretty! This has been a really wonderful article.
Thanks for providing these details.