The gibberish score is a pending Google patent from 2009, which is supposed to make it easier for the search engine, to distinguish unique content more specifically from thin content, copied content, and webspam.
Gibberish literally means unintelligible or meaningless speech or writing. It is part of a language that does not represent meaningful significance. In practice, gibberish is used in vocal exercises before concerts, in theaters when a speaking crowd is to be simulated, for example. The gibberish score is therefore supposed to recognize meaningless content.
Ever since the implementation of the influential Panda and Penguin updates, Google has massively stepped up its efforts in fighting webspam in its search result lists. With the Hummingbird update in autumn 2013, the search engine has underlined once again how precisely it can evaluate queries to deliver appropriate search results. The patent-pending gibberish score is intended to help to distinguish “good” content from “bad” content that has no added user value.
The gibberish score of a document is determined according to specifications based on two different approaches:
The evaluation of the two “scores” described here, comprises the gibberish score of a web document. The consequence of too many gibberish elements in a text is the devaluation of the webpage in question in the rankings for a specific search term.
The announcement that Google had filed a patent for the gibberish score fits into the debate about WDF*IDF optimization which was especially intense in 2013. According to the TF*IDF principle, text is no longer created only based on keyword density, but also the meaning of the individual terms in relation to their use on other websites which are relevant to the keyword being optimized for.
Thus, the task of on-page optimization is becoming more complex especially in content creation. Content generation can be easily outsourced to a company such as Demand Media. The gibberish score can serve to control. Massively keyword optimized content, lightly rewritten content, and machine-generated text are identified more easily as such through the use of the gibberish score. The Google patent is therefore a further step towards high quality websites that are created for visitors and not for the search engine.