Files and Directories | Search Engines and Links
The underlying file-structure on the server may be largely invisible to the end-user, but it is still worth thinking carefully about. First of all, it's not completely invisible to the user: URLs are visible. It is important to keep URLs short so people can cut and paste them into and from mail easily. Where possible file and directory names should be words - search engines look at these and people can remember them. Keep names in lowercase - also avoid using spaces, apostrophes, or other characters that can confuse humans or software, and don't mix dashes and underscores. Encode as much information as possible into the file structure - give files and directories sensible and informative names. While it is also possible to put metadata into the file itself, it's not guaranteed that you (or others working on the site) will remember to do this, and such metadata can be hard to view. In contrast, a file has to have a name and be in some nested set of directories, and its location and name can be viewed using ordinary file-system tools. Make use of the hierarchical file-system. Even if it appears silly to create a new directory just to put one "index.html" file into, this allows room for future expansion. It is also important to have files named "index.html" (or whatever your server defaults to when a directory is requested) so "URL chopping" will work. |
Specifics:
Examples:
|
Links to your pages from other sites are valuable. Not only will they bring visitors, but they will contribute to your site's ranking in citation measures like Google PageRank. So you should not break links by changing file names or removing old files. Every page should have a TITLE metadata field that is descriptive without any context. TITLEs are used in search engine result lists, in bookmarks, and by automated link-creation software. So people will often view page titles in isolation from everything else on the page or site. Every page must have its own URL. Implementations of frames where the location doesn't change as one changes pages are inherently broken, as they make it difficult or impossible to bookmark or link to specific pages. Avoid having multiple addresses for the same page. With search engines doing more sophisticated ranking, you risk having your page appear twice in rankings - at positions 30 and 50 perhaps - instead of once - in the top 10. |
Specifics:
|