How does Google recognize 2 words without spaces?

I want to understand how Google does not fill the gap between two words. For example, there are 2 words - word1 and word2. I write in the search field "word1word2", it says that you mean the word word1 word2 or just understand "word1 word2". Any information, what data structure and algorithm do they use? I see in this answer How to split text without spaces into a list of words? , it is proposed to use the trie data structure.

+5
source share
3 answers

In creating a candidate from the spell corrector, you allow as the possibility of skipping space, just as you allow the omission of other letters .... Perhaps see the spelling correction vocabulary here: http://nlp-class.org/ [sorry, self-promotion] or Introduction by Peter Norwig: http://norvig.com/spell-correct.html

+4
source

Maybe they are looking in the dictionary using the prefix of the entered word?

+1
source

, script ( ajax http://net.tutsplus.com/tutorials/javascript-ajax/adding-a-jquery-auto-complete-to-your-google-custom-search-engine/)

Basically you check the words in a dictionary. Space should not be a condition for verifying a word, but simply an opportunity. For example, a simple algorithm (really simple) would be as follows: “a few words” you check the letter 3 first, nothing? Then you check the first 4 ...

Here are some explanations on Google search: https://developers.google.com/search-appliance/documentation/60/admin_searchexp/ce_improving_search

Maybe it can help here too: http://tm.durusau.net/?cat=1106

+1
source

All Articles