Soundex: Difference between revisions

100 bytes removed ,  5 January 2024
m
Text replacement - " " to " "
m (Text replacement - "Category:Personal Names" to "Category:Naming Customs")
m (Text replacement - " " to " ")
Line 30: Line 30:
=== How Soundex Works  ===
=== How Soundex Works  ===


Soundex is based on the classification of letters of the alphabet (consonants) into six sound-alike key letter groups. For example, in many languages the '''B''' and '''V''' sounds are nearly interchangeable; as are '''B''' and '''P'''; and '''V''' and '''F'''. So the first phonetic group of key letter consonants is '''b, f, p, v'''. Vowels are fluid and disregarded, as are '''H''' and '''W'''. By giving the same value to key letter consonants that often sound alike, the index brings names together that would usually be pronounced alike with little regard to their actual spelling. Each sound-alike group of key letter consonants is assigned a number. Each family name is assigned a Soundex code that has the initial letter of the name followed by exactly three of the sound-alike key letter group numbers. For example, '''''Stewart''''' = S363 and '''''Stuart''''' = S363.  
Soundex is based on the classification of letters of the alphabet (consonants) into six sound-alike key letter groups. For example, in many languages the '''B''' and '''V''' sounds are nearly interchangeable; as are '''B''' and '''P'''; and '''V''' and '''F'''. So the first phonetic group of key letter consonants is '''b, f, p, v'''. Vowels are fluid and disregarded, as are '''H''' and '''W'''. By giving the same value to key letter consonants that often sound alike, the index brings names together that would usually be pronounced alike with little regard to their actual spelling. Each sound-alike group of key letter consonants is assigned a number. Each family name is assigned a Soundex code that has the initial letter of the name followed by exactly three of the sound-alike key letter group numbers. For example, '''''Stewart''''' = S363 and '''''Stuart''''' = S363.  


Modern online search engines that use Soundex do it without displaying the Soundex codes—similar names spelled differently simply appear together on the search results list. But from time to time a researcher may need to understand Soundex codes in order to use one of the older Soundex indexes on microfilm.  
Modern online search engines that use Soundex do it without displaying the Soundex codes—similar names spelled differently simply appear together on the search results list. But from time to time a researcher may need to understand Soundex codes in order to use one of the older Soundex indexes on microfilm.  
Line 41: Line 41:


*Every soundex code consists of a letter and three numbers, such as D432.  
*Every soundex code consists of a letter and three numbers, such as D432.  
*The letter is always the first letter of the name. For example, '''Clausen''' = C425, and '''Klausen''' = K425.  
*The letter is always the first letter of the name. For example, '''Clausen''' = C425, and '''Klausen''' = K425.  
*After the first letter, disregard vowels ('''a, e, i, o, u,''' and '''y''') and ignore the consonants '''h''', and '''w'''.  
*After the first letter, disregard vowels ('''a, e, i, o, u,''' and '''y''') and ignore the consonants '''h''', and '''w'''.  
*Numbers are assigned to the remaining letters of the name according to the table of ''Soundex Key Letter Codes'' shown below.  
*Numbers are assigned to the remaining letters of the name according to the table of ''Soundex Key Letter Codes'' shown below.  
*Zeroes are added at the end if necessary to produce a four-character code. Excess letters are disregarded if they would produce a code longer than four-characters. For example '''Lee''' = L000, and '''Christopherson''' = C623.
*Zeroes are added at the end if necessary to produce a four-character code. Excess letters are disregarded if they would produce a code longer than four-characters. For example '''Lee''' = L000, and '''Christopherson''' = C623.


{| width="225" cellspacing="1" cellpadding="1" align="center" class="plain FCK__ShowTableBorders"
{| width="225" cellspacing="1" cellpadding="1" align="center" class="plain FCK__ShowTableBorders"
Line 54: Line 54:
|-
|-
| valign="middle" align="right" | 1<br>  
| valign="middle" align="right" | 1<br>  
| &nbsp;
|  
| b, f, p, v <br>
| b, f, p, v <br>
|-
|-
Line 80: Line 80:
===== Additional Rules  =====
===== Additional Rules  =====


*'''Double key letters''' should be treated as one letter. For example, '''Gutierrez'''&nbsp;= G362.  
*'''Double key letters''' should be treated as one letter. For example, '''Gutierrez''' = G362.  
*'''Side-by-side letters with the same code number''' should be treated as one letter. For example, '''Campbell'''&nbsp;= C514, and '''Jackson'''&nbsp;= J250, and '''Pfister'''&nbsp;= P236.  
*'''Side-by-side letters with the same code number''' should be treated as one letter. For example, '''Campbell''' = C514, and '''Jackson''' = J250, and '''Pfister''' = P236.  
*'''Vowel key letter seperators.''' If a vowel ('''a, e, i, o, u, y''') separates key letters that have the same code number, those key letters should be treated as two letters. For example, '''Tomzak'''&nbsp;= T522, and '''Roses'''&nbsp;= R220.  
*'''Vowel key letter seperators.''' If a vowel ('''a, e, i, o, u, y''') separates key letters that have the same code number, those key letters should be treated as two letters. For example, '''Tomzak''' = T522, and '''Roses''' = R220.  
*'''H or W key letter seperators.''' If an '''h''' or '''w''' separates key letters that have the same code number, those key letters should be treated as one letter. For example, '''Ashcroft'''&nbsp;= A261, and '''Carwruth'''&nbsp;= C630.  
*'''H or W key letter seperators.''' If an '''h''' or '''w''' separates key letters that have the same code number, those key letters should be treated as one letter. For example, '''Ashcroft''' = A261, and '''Carwruth''' = C630.  
*'''Names with prefixes''', such as Van, Con, De, Di, La, or Le, are coded both with and without the prefix because the name might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example, '''Van Deusen'''&nbsp;= V532 or D250.
*'''Names with prefixes''', such as Van, Con, De, Di, La, or Le, are coded both with and without the prefix because the name might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example, '''Van Deusen''' = V532 or D250.


==== More Soundex Examples  ====
==== More Soundex Examples  ====
Line 94: Line 94:
| ''Name''  
| ''Name''  
| ''Code''  
| ''Code''  
| width="8%" |     &nbsp;
| width="8%" |      
| ''Name''  
| ''Name''  
| ''Code''  
| ''Code''  
| width="8%" |     &nbsp;
| width="8%" |      
| ''Name''  
| ''Name''  
| ''Code''  
| ''Code''  
| width="8%" |     &nbsp;
| width="8%" |      
| ''Name''  
| ''Name''  
| ''Code''
| ''Code''
Line 177: Line 177:
=== Related Content  ===
=== Related Content  ===


*Rick Parsons, ''<ref>[http://west-penwith.org.uk/misc/soundex.htm Soundex - the True Story ''</ref> (http://west-penwith.org.uk/misc/soundex.htm &nbsp;: accessed 30 July 2008). ''  
*Rick Parsons, ''<ref>[http://west-penwith.org.uk/misc/soundex.htm Soundex - the True Story ''</ref> (http://west-penwith.org.uk/misc/soundex.htm : accessed 30 July 2008). ''  
*''<ref>[http://www.archives.gov/genealogy/census/soundex.html The Soundex Indexing System] </ref>'' ''The National Archives'' (http://www.archives.gov/genealogy/census/soundex.html &nbsp;: accessed 30 July 2008).  
*''<ref>[http://www.archives.gov/genealogy/census/soundex.html The Soundex Indexing System] </ref>'' ''The National Archives'' (http://www.archives.gov/genealogy/census/soundex.html : accessed 30 July 2008).  
*Gary Mokotoff, "<ref>[http://www.avotaynu.com/soundex.html Soundexing and Genealogy]</ref>" ''Avotaynu'' (http://www.avotaynu.com/soundex.html &nbsp;: accessed 30 July 2008).  
*Gary Mokotoff, "<ref>[http://www.avotaynu.com/soundex.html Soundexing and Genealogy]</ref>" ''Avotaynu'' (http://www.avotaynu.com/soundex.html : accessed 30 July 2008).  
*<ref>United States Census Indexes United States Census Indexes</ref> FamilySearch Wiki article.  
*<ref>United States Census Indexes United States Census Indexes</ref> FamilySearch Wiki article.  
*<ref>[[Finding a Person in the 1930 Census (Even Without An Index)| Finding a Person in the 1930 Census (Even without and Index)]]</ref> FamilySearch Wiki article.
*<ref>[[Finding a Person in the 1930 Census (Even Without An Index)| Finding a Person in the 1930 Census (Even without and Index)]]</ref> FamilySearch Wiki article.