Wellesley College Wellesley College
Search  
Excludes Library catalog; search options
Ribbon
 Home · About · Academics · Administration · Admission · Calendar/Events · Places · Computing/Library · News/Public Affairs
 Alumnae · Faculty/Staff · Parents/Families · Prospective Students · Students · Visitors

Designing for Search Engines

This information will prove to be a little frustrating, as it sometimes flies in the face of contemporary web design. It is, however, very much in keeping with the principles of usability.

While each search engine -- whether searching a local server or web-wide -- is different, there are many things you can do to enhance the likelihood of having your content found.

Make use of these techniques:

  • Have a good description of the goals/content of the page in the first paragraph. Many search engines will search only the first paragraph or the first xxx words of the page text.

  • Think about your audience -- both the ones who are insiders and understand the terminology to be used, and those who are unfamiliar with the names of specific offices at Wellesley or the language of your discipline. Find ways of speaking to your entire audience so that they can find the term they know via the search engine.

  • Avoid the excessive use of common words; unique terms will assist the searcher to reach your page faster. If a page has a lot of incidences of "education", "school" and "women", it will be harder to find on our local search engines since everyone here uses those words. People become frustrated when they can't find any unique terms to assist in narrowing down the search results.

  • Use clear navigational links throughout you site so that a spider which is crawling from link to link for indexing purposes will find things readily. Keeps these links up to date as you revise pages.

  • Use invisible "meta" tags to augment the language in your text

  • Whenever you include a graphic [whether a picture or a text graphic], describe it with an alt tag.

  • Be sure to use a descriptive and unique title tag for each page. This is the information seen at the very top of your browser window, which usually prints out along with the URL when you print a page. Most search engines post this title in the results and regard it as important when assigning relevancy for display purposes.

    Examples of bad titles:

    Untitled (this default title appears when you fail to set a title)
    socsci (an acronym with no meaning to anyone)
    page 1 (of what?)

    Examples of helpful titles:

    Political Science 842: Antarctic Base Politics
    Admission: Contacts
    Folk Song Club: How to Join

Avoid these practices:

  • Writing in frames. Most search engines cannot follow links for frames, thus the content is lost to indexing.

  • Scripting. [This one is really hard to swallow, so just be aware.] Scripts cause some search engine spiders to stop indexing. On our local SWISH-E search engine, the indexing is done, but sometimes the script displays where the initial text of the page would display in the results.

  • Setting text in graphics without providing alt tags. If, for design control, you create the entire page as a series of images, there will be nothing for a search engine to index, except for the required page dogtag.

  • Writing without unique terms.

  • Failing to create a page title using the <title> tag.

  • Forgetting to use alt tags to describe images.

  • Situating your content way down in a series of folders within folders. This URL is unlikely to enhance your chances of being found by a search engine:
    www.wellesley.edu/Folder1/Subfolder1/SubSubfolder1/file.html
    The deeper your content, the less likely that an external search engine's spider will find your information. The other down side to this practice is that your URL or address will be very long.

  • Tricking the search engine by repeating the same text over and over in meta tags or invisible text. Don't! Most external search engines can detect this tactic and may block your content from being indexed.

  • Forgetting to remove old pages on our server. A search engine may find them and may provide outdated information to unsuspecting search engine users.

What may not be indexed:

  • Frames sites and scripting in Javascript or Java applets are often not indexed.

  • Files made up solely of graphics or Flash animations

  • Most search engines will not index every word [other than stopwords] on your site. They may only record the first paragraph or the first hundred words, for example

  • Content which is locally restricted via passwording, a "local only" protection, or other means.

  • Most search engines cannot index the content of PDF files because they aren't HTML text. Google can index PDF files.

  • XML code may preclude indexing until the standard is more widely adopted.

  • Images, audio and video files

  • Text created as graphics [though if you assign an alt tag to it, that tag may be indexed.

Comprehensiveness of indexing:

  • Many search engines do not index every word or page.

  • Pages which are not linked to other pages will seldom be indexed. However, this is not a reason to fail to remove old pages. Old pages are found via old links, and search engines that fully index a server's content will also turn them up. Housekeeping is almost always a virtue...

A note about timeliness of indexing:

  • Our local SWISH-E search engine is fully reindexed early every Saturday morning.

  • The local Google search engine results from a spider which crawls the site approximately monthly, jumpinf rom link to link.

  • External search engines have spiders which crawl the world wide web on an average of every month or two.

Registering your site with search engines:

Registering with external search engines is worthwhile if you want to be found by folks searching quickly after you put the site up. This is important for new or revised web sites which have time-sensitive information.

While several commercial firms provide registration services, they typically suggest that you need to be registered at hundreds of search engines. You do not! You can register for free at the most important search engines yourself -- perhaps 10 in all -- to good effect. An opportunity for registration is often found on the search engine's homepage.

Similarly, send your URL to the webmasters of a few important pages linking to sites similar to yours, or to a few Internet directory sites such as Yahoo. For ideas, see important search engines/indexes/directories listed on the Library's Search the Web page.

We recommend that you do not pay a fee to a search engine resource for preferred placement in that same resource.

Any questions? For help in diagnosing your placement in search engine results or for assistance with automated link checking, contact webmaster@wellesley.edu.


 


Site Index · Contacts · Directories
Help

Course Catalog
Directories
Library
News Resources
Transportation
Weather
Wellesley & Region

 

Information Services webmaster@wellesley.edu
Page Created: May 30, 2001
Last Modified: August 13, 2005
Page Expires: June 30, 2008