Plain Old Latent Semantic Indexing

Latent Semantic Indexing. Cool sounding term. Now, before I get into what it actually is…can you imagine how smart you’ll sound when you use that term with your friends, as they discuss web technologies? But I digress…

Here’s the definition of LSI use from SEOBook.com (taken from http://www.seobook.com/lsi/lsa_definition.htm):

Latent semantic indexing adds an important step to the document indexing process. In addition to recording which keywords a document contains, the method examines the document collection as a whole, to see which other documents contain some of those same words. LSI considers documents that have many words in common to be semantically close, and ones with few words in common to be semantically distant. This simple method correlates surprisingly well with how a human being, looking at content, might classify a document collection. Although the LSI algorithm doesn’t understand anything about what the words mean, the patterns it notices can make it seem astonishingly intelligent.

So how does this effect you and your Plain Old Semantic HTML? Well, for starters it will give you a better grasp of your sites overall theme, and how closely matched the context is in correlation to each other. Since Google has already been using this type of technology in it’s algorithms, and it looks like the other SE’s are following suit, I urge you to really analyze your keywords within content for their semantical relationships and values.

Popularity: 18% [?]

[Slashdot] [Digg] [Reddit] [del.icio.us] [Facebook] [Technorati] [Google] [StumbleUpon]

About the Author

Martin

Member of the GAWDs and Web Standards Group. As a member of the Web Standards Group, as well as membership in the Guild of Accessible Web Designers, I strive to achieve maximum website accessibility in my designs, while using the latest, cutting-edge eXtensible HyperText Mark-up Language/CSS markup. My sites fully validate with the W3C standards for XHTML/CSS, as well as comply with the W3C's WAI - A, AA, or AAA standards. Separation of Content from presentation and a site that fully complies with this philosophy - will reach more than just your intended audience, it'll reach everyone! Web standards at FierceStreet Networks is all about usability - XHTML for content, and CSS for presentation.

Leave a Reply

Please copy the string M419YF to the field below:

You can use these XHTML tags: <a href="" title=""> <abbr title=""> <acronym title=""> <blockquote cite=""> <code> <em> <strong>