Friday, August 25, 2006

Hub Needs Speech Recognition Grammar Specification (SRGS) recognizer

The Speech Recognition Grammar Specification is a w3c standard for XML speech recognition data markup. It is developing along with VoiceXML and voice browser projects for the web.

The current version of Sphinx uses the more convenient Java Speech Grammar Format (JSGF) provided by Sun microsystems.

JSGF simply doesn't cover data retention and lookup. It is meant for recognizers, not for dictionaries. SRGS has a good deal of markup allowing several methods of lookups. Also, due to the nature of XML, standardized corpora can be developed, with a data structure that is shared among the data sources. This will facilitate large and robust community development.

So, if you will, please, consider for a moment the needs of algorithm development. Sun has left the data structuring and lookup to future developers (me). Sphinx wants to ignore the need to use a more complex system, instead using binary ARPA format for data storage. All admit that this doesn't produce a dynamic quick recognition result.

i admit it's not easy but then there are lots of HTML jobs out there if you don't like the rough stuff...

0 Comments:

Post a Comment

<< Home