<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article"><front><journal-meta><journal-id journal-id-type="publisher-id">INFORMATICA</journal-id><journal-title-group><journal-title>Informatica</journal-title></journal-title-group><issn pub-type="epub">0868-4952</issn><issn pub-type="ppub">0868-4952</issn><publisher><publisher-name>VU</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">inf15107</article-id><article-id pub-id-type="doi">10.15388/Informatica.2004.048</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research article</subject></subj-group></article-categories><title-group><article-title>Specifics of Hidden Markov Model Modifications for Large Vocabulary Continuous Speech Recognition</article-title></title-group><contrib-group><contrib contrib-type="Author"><name><surname>Šilingas</surname><given-names>Darius</given-names></name><email xlink:href="mailto:i5dasi@vaidila.vdu.lt">i5dasi@vaidila.vdu.lt</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><aff id="j_INFORMATICA_aff_000">Department of Applied Informatics, Vytautas Magnus University, Vileikos 8, LT‐3035 Kaunas, Lithuania</aff></contrib-group><contrib-group><contrib contrib-type="Author"><name><surname>Telksnys</surname><given-names>Laimutis</given-names></name><email xlink:href="mailto:telksnys@ktl.mii.lt">telksnys@ktl.mii.lt</email><xref ref-type="aff" rid="j_INFORMATICA_aff_001"/></contrib><aff id="j_INFORMATICA_aff_001">Department of Applied Informatics, Vytautas Magnus University, Recognition Processes Department, Institute of Mathematics and Informatics, Goštauto 12–205, 08663 Vilnius, Lithuania</aff></contrib-group><pub-date pub-type="epub"><day>01</day><month>01</month><year>2004</year></pub-date><volume>15</volume><issue>1</issue><fpage>93</fpage><lpage>110</lpage><history><date date-type="received"><day>01</day><month>07</month><year>2003</year></date></history><abstract><p>Specifics of hidden Markov model‐based speech recognition are investigated. Influence of modeling simple and context‐dependent phones, using simple Gaussian, two and three‐component Gaussian mixture probability density functions for modeling feature distribution, and incorporating language model are discussed. Word recognition rates and model complexity criteria are used for evaluating suitability of these modifications for practical applications. Development of large vocabulary continuous speech recognition system using HTK toolkit and WSJCAM0 English speech corpus is described. Results of experimental investigations are presented.</p></abstract><kwd-group><label>Keywords</label><kwd>large vocabulary continuous speech recognition</kwd><kwd>hidden Markov model</kwd><kwd>Viterbi recognition</kwd><kwd>beam search</kwd><kwd>context‐dependent phones</kwd><kwd>Gaussian mixture</kwd><kwd>language modeling</kwd><kwd>HTK</kwd><kwd>WSJCAM0</kwd></kwd-group></article-meta></front></article>