Finding meaning behind the words
Page 4


...Another key to the extraction of meaning from speech is successfully finding “phones,” the most elementary characteristics of speech, according to the University of Washington’s Signal, Speech and Language Interpretation Lab. Phones of the world’s languages can be described and uniquely identified by a compact set of approximately 30 features – the sound that we made for the letter “d”, for example, is a universal one that involves pressing the tongue against the palate. But merely add a single vowel and you have additional variability. Pronounce “di” and “du” and you can see the difference. By creating software that can identify speech by breaking it down into its smallest component parts, it will be much easier to write recognition software for specific languages, built from the ground up.

“Meaning is something that we as humans take for granted but is not easy to define, much less capture,” says Alex Acero, a senior researcher at Microsoft and manager of the company’s speech technology group. Before we will have robots that understand whom we’re talking about when we say “her” or be able to distinguish sarcasm, software will need to identify and understand many variables. Filtering out ambient noise, not having to define a word’s meaning every time it is used, or simply having a computer say “I don’t understand” when meaning is uncertain are just a few of the hurdles to overcome. r

photo credit: www.freeimages.co.uk