电脑桌面
添加小米粒文库到电脑桌面
安装后可以在桌面快捷访问

中英文献翻译:语音识别speechrecognitionVIP免费

中英文献翻译:语音识别speechrecognition_第1页
1/20
中英文献翻译:语音识别speechrecognition_第2页
2/20
中英文献翻译:语音识别speechrecognition_第3页
3/20
中英文献翻译:语音识别speechrecognitionSpeechRecognitionVictorZue,RonCole,&WayneWardMITLaboratoryforComputerScience,Cambridge,Massachusetts,USAOregonGraduateInstituteofScience&Technology,Portland,Oregon,USACarnegieMellonUniversity,Pittsburgh,Pennsylvania,USA1DefiningtheProblemSpeechrecognitionistheprocessofconvertinganacousticsignal,capturedbyamicrophoneoratelephone,toasetofwords.Therecognizedwordscanbethefinalresults,asforapplicationssuchascommands&control,dataentry,anddocumentpreparation.Theycanalsoserveastheinputtofurtherlinguisticprocessinginordertoachievespeechunderstanding,asubjectcoveredinsection.Speechrecognitionsystemscanbecharacterizedbymanyparameters,someofthemoreimportantofwhichareshowninFigure.Anisolated-wordspeechrecognitionsystemrequiresthatthespeakerpausebrieflybetweenwords,whereasacontinuousspeechrecognitionsystemdoesnot.Spontaneous,orextemporaneouslygenerated,speechcontainsdisfluencies,andismuchmoredifficulttorecognizethanspeechreadfromscript.Somesystemsrequirespeakerenrollment---ausermustprovidesamplesofhisorherspeechbeforeusingthem,whereasothersystemsaresaidtobespeaker-independent,inthatnoenrollmentisnecessary.Someoftheotherparametersdependonthespecifictask.Recognitionisgenerallymoredifficultwhenvocabulariesarelargeorhavemanysimilar-soundingwords.Whenspeechisproducedinasequenceofwords,languagemodelsorartificialgrammarsareusedtorestrictthecombinationofwords.Thesimplestlanguagemodelcanbespecifiedasafinite-statenetwork,wherethepermissiblewordsfollowingeachwordaregivenexplicitly.Moregenerallanguagemodelsapproximatingnaturallanguagearespecifiedintermsofacontext-sensitivegrammar.1Onepopularmeasureofthedifficultyofthetask,combiningthevocabularysizeandthelanguagemodel,isperplexity,looselydefinedasthegeometricmeanofthenumberofwordsthatcanfollowawordafterthelanguagemodelhasbeenapplied(seesectionforadiscussionoflanguagemodelingingeneralandperplexityinparticular).Finally,therearesomeexternalparametersthatcanaffectspeechrecognitionsystemperformance,includingthecharacteristicsoftheenvironmentalnoiseandthetypeandtheplacementofthemicrophone.ParametersRangeSpeakingModeIsolatedwordstocontinuousspeechSpeakingStyleReadspeechtospontaneousspeechEnrollmentSpeaker-dependenttoSpeaker-independentVocabularySmall(<20words)tolarge(>20,000words)LanguageModelFinite-statetocontext-sensitivePerplexitySmall(<10)tolarge(>100)SNRHigh(>30dB)tolaw(<10dB)TransducerVoice-cancellingmicrophonetotelephoneTable:TypicalparametersusedtocharacterizethecapabilityofspeechrecognitionsystemsSpeechrecognitionisadifficultproblem,largelybecauseofthemanysourcesofvariabilityassociatedwiththesignal.First,theacousticrealizationsofphonemes,thesmallestsoundunitsofwhichwordsarecomposed,arehighlydependentonthecontextinwhichtheyappear.Thesephoneticvariabilitiesareexemplifiedbytheacousticdifferencesofthephoneme,Atwordboundaries,contextualvariationscanbequitedramatic---makinggasshortagesoundlikegashshortageinAmericanEnglish,anddevoandaresoundlikedevandareinItalian.Second,acousticvariabilitiescanresultfromchangesintheenvironmentaswellasinthepositionandcharacteristicsofthetransducer.Third,within-speakervariabilitiescanresultfromchangesinthespeaker'sphysicalandemotionalstate,speakingrate,orvoicequality.Finally,differencesinsociolinguisticbackground,dialect,andvocaltractsizeandshapecancontributetoacross-speakervariab...

1、当您付费下载文档后,您只拥有了使用权限,并不意味着购买了版权,文档只能用于自身使用,不得用于其他商业用途(如 [转卖]进行直接盈利或[编辑后售卖]进行间接盈利)。
2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。
3、如文档内容存在违规,或者侵犯商业秘密、侵犯著作权等,请点击“违规举报”。

碎片内容

中英文献翻译:语音识别speechrecognition

确认删除?
VIP
微信客服
  • 扫码咨询
会员Q群
  • 会员专属群点击这里加入QQ群
客服邮箱
回到顶部