Wednesday, August 30, 2006

Corpora dictionary data storage algorithm

the binary format is difficult to search.

it seems that each phoneme should be considered a software object. The object should contain properties reflecting the cepstral acoustic data in vector format, using a standardized identity matrix .


the phoneme objects should be related to database tables containing the weighted transitions for the phoneme. Lookup will not require full dictionary parsing, merely the datasets related to the object. The database should contruct relational tables for phonemes when new entries are added.

0 Comments:

Post a Comment

<< Home