




已閱讀5頁(yè),還剩37頁(yè)未讀, 繼續(xù)免費(fèi)閱讀
版權(quán)說(shuō)明:本文檔由用戶(hù)提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
關(guān)于數(shù)學(xué)公式語(yǔ)音 項(xiàng)目的調(diào)研,報(bào)告人:彭云輝 2010-4-8,綱要,國(guó)外關(guān)于數(shù)學(xué)語(yǔ)音的一些相關(guān)的研究 這些項(xiàng)目表示數(shù)學(xué)公式所采用的語(yǔ)言/格式 相關(guān)項(xiàng)目的大體思路及意圖 如何消除歧義 項(xiàng)目的優(yōu)點(diǎn)和局限性及對(duì)項(xiàng)目的改進(jìn)意見(jiàn),主要相關(guān)項(xiàng)目及對(duì)應(yīng)機(jī)構(gòu)或人物,另外,還有一些機(jī)構(gòu)、團(tuán)體對(duì)數(shù)學(xué)公式語(yǔ)音作了深入的研究,并取得了一定的成果。如University of California at Berkeley(How can we speak math/Math Speak & Write, a Computer Program to Read and Hear Mathematical Input) 和 School of Computing, Dublin City University (Mathematics: How and What to Speak)等。,RETURN,數(shù)學(xué)表達(dá)式源語(yǔ)言,RETURN,an excellent prototype for speaking mathematics LaTeX to audio documents. Can speak both literary texts and highly technical documents that contain complex mathematics. the adequacy of the audio rendering depends on how well the electronic document captures the essential internal structure of the information. produced a structured representation and an audio formatting language (AFL) to provide an interactive environment for listening to and browsing technical documents. uses the Emacs(文本編輯器) front-end (Linux).,有關(guān)AsTeR( Audio System for Technical Readings ) (Raman 1994),AsTeR(續(xù)),creates an internal representation easier,used to help the audio rendering,Mathtalk的大體思路及意圖 1. a set of rules to insert prosodic cues into spoken algebraic expressions. 2. analyzed the way mathematics teachers speak mathematical expressions and integrated these natural voice inflections(音調(diào)變化). 3. only insertion of prosodic cues (pitch, amplitude, and pauses) into computer-spoken mathematical expressions ,none insertion of lexical cues.,creates the necessary HTML/XML tags for visually-impaired and blind users to use their current screen-reading tools (e.g. JAWS and Window-Eyes for Windows) to read HTML and MathML/XML pages that contain math expressions, to read them in a spoken language.,This markup is not displayed in the browser. Only the MathML visual markup, or a PNG image, or a LiveMath Plug-In interactive image - whatever the author intended, is shown. The “MathSpeak This“ function makes it possible to hear the expression read during the creation/editing process,大體思路: 1. deriving Braille and Spoken Output from LaTeX Documents 2. render spoken mathematics from MathML using prosodic features such as pauses and speaking rate . 3. the use of prosody in synthesized speech to indicate nesting structure. 主要意圖: To take a subset of LaTeX and produce both Braille and Spoken out from it. To accurately model a document and to present this to the blind user using a simple and intuitive interface. To harness the capabilities of synthetic speech devices to give more meaningful spoken output to the user.,TechRead的大體思路及意圖,AudioMath的大體思路及意圖 made use of its own database of prosodic rules in the generation of the spoken expressions. Available in 4 different ways: - ActiveX DLL - .NET component - CGI interface - Executable EXE,Auto-Discovery (the “brain” of the operation that recognizes or identifies elements in the document and calls the respective conversion modules ) Numerals (conversion of several types of numeric forms) Abbreviations Acronyms Network References Mathematical (MathML expressions ),6 modules for the conversion part,AudioMath設(shè)計(jì)流程圖,MathPlayer,a plug-in to Microsoft Internet Explorer (IE) and Adobe Acrobat/Reader that renders MathML visually. is able to dynamically display a mathematical expression according to its font and the color set, users can choose the most suitable font or color scheme for their reading needs. For example, visually impaired readers are likely to set a large font and high color contrast.,上述公式在MathPlayer中的讀法為: cap U bar sup h equals one minus exponent open minus fraction 8 cap T sup h over end fraction close,讀法: equals ln open fraction n over s end close plus open fraction k sup h over k prime sup h end fraction close ln open s close minus zero point seven five plus open two l z minus z squared close fraction k sup h over q sup w end fraction.,MathSpeak Project的大體思路及意圖 The project is one of the proposed methods, consisting of a group of rules to dictate mathematical contents. However it is not a standard and it is intended to serve blind people that want to transcribe their documents into Nemeth Code 18, and later on into Braille.,RETURN,如何消除歧義,兩大策略:,Use of lexical indicators (a) x plus begin fraction one over x end fraction minus one (b) begin fraction xplusone over x end fraction minus one Use of prosodic indicators(pauses, modifications of pitch and tempo, rhythm and tone) (a) x plus one over x minus one (b) “ xplus one over x minus one,在消除歧義這一部分,MathPlayer沒(méi)什么優(yōu)勢(shì),而AudioMath在這一方面做得很不錯(cuò),其余的一些相關(guān)軟件也沒(méi)什么出眾的地方,下面著重談?wù)凙udioMath,AudioMath(葡萄牙語(yǔ)),Lexical Square root of power base a exponent two, end of power, plus power base b exponent two, end of power, end of radicand Prosodic Square root of (LP) a squared (SP) plus b squared (LP) end of radicand,AudioMath tone rules:,1- Rising tone: used when a lower hierarchical level is starting. (root of) 2- Falling and Rising tone: used to mark the smaller separating pause. (a squared) 3- Falling tone: used when level is ended. (b squared) 4- Emphatic Falling tone: used at the end of the expression that simultaneously is the higher hierarchical level (end of radicand).,LP,SP,LP,RETURN,AudioMath優(yōu)點(diǎn):,supports usermode options. An example : 1.25 one point twenty five OR one point two five,Future Work of AudioMath - Complete the support for MathML Content Markup - Study in more detail mathematical prosody - Implement a proper blind tool - Add more languages - Enhancements on XHTML support - Implement SAPI, SSML support for TTS technologies,MathPlayer局限性,the use of tables and the representation of matrices and the possibility of some ambiguous readings no math formulae navigation support. gets complicated with complex math expressions no provision for any kind of user adapted preferences scheme(usermode) has ambiguous rendering in some mathematical expressions. Does not use prosody to render mathematical expressions by speech output. It generates text strings made up of the names of mathematical symbols and commas and periods to set pauses.,MathPlayer優(yōu)勢(shì),allows web browsers users to copy a MathML expression and paste it in a MathML-aware program. This is particularly useful for computation, but might also be useful when used in conjunction with other software aimed at making math accessible (e.g. the LAMBDA system) or with mainstream applications used to process scientific documents (e.g. MathType or Scientific Notebook).,Changes in MathPlayer 2.2,MathPlayer 2.2 (released February 2010) is an upgrade and includes the following: Significantly improved font handling and rendering: Improved support for STIX, Cambria, and other Unicode fonts. Improvements for anti-aliased rendering. Better protection against fonts that contain errors in their tables. More characters are displayed. Improved performance when Internet Explorers zoom is not 100%. Improved compatibility with ASCIIMathML. Fixed bugs with content MathML (handling of , “Copy MathML“),Future work of MathPlayer,MathPlayers speech rules are based upon a pattern matcher/rule system. The rules are able to specify synchronization points and prosody in addition to text to speak. The rules provide a great deal of flexibility and allow users to match structures such as limits and integrals so that they are spoken in the customary manner rather than treating them as general expressions with limits and/or scripts.,Future work of MathPlayer (續(xù)),The downside to this power is that
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶(hù)所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶(hù)上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶(hù)上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶(hù)因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 網(wǎng)絡(luò)普法考試試題及答案
- 工程招投標(biāo)管理與協(xié)議執(zhí)行流程規(guī)范
- 浙江國(guó)企招聘2025溫州市甌海旅游投資集團(tuán)有限公司及下屬子公司招聘10人筆試參考題庫(kù)附帶答案詳解
- 2025福建福州市建筑大數(shù)據(jù)技術(shù)有限公司招聘4人筆試參考題庫(kù)附帶答案詳解
- 2025河南鄭州二七區(qū)一國(guó)企招聘各部門(mén)人員9人筆試參考題庫(kù)附帶答案詳解
- 2025江蘇徐州東創(chuàng)新能源科技有限公司招聘19人筆試參考題庫(kù)附帶答案詳解
- 2025年合肥興泰金融控股(集團(tuán))有限公司招聘23人筆試參考題庫(kù)附帶答案詳解
- 2025山東芳蕾玫瑰科技開(kāi)發(fā)有限公司招聘11人筆試參考題庫(kù)附帶答案詳解
- 幼兒園秋游安全教案
- 色彩理論在廣告設(shè)計(jì)中的試題及答案
- 2024年內(nèi)蒙古呼和浩特中考?xì)v史真題卷及答案解析
- 2025年國(guó)投交通控股有限公司招聘筆試參考題庫(kù)含答案解析
- 【MOOC答案】《中國(guó)文化傳承與科技創(chuàng)新》(北京郵電大學(xué))中國(guó)慕課章節(jié)作業(yè)網(wǎng)課答案
- GB/T 45015-2024鈦石膏綜合利用技術(shù)規(guī)范
- 郵政社招筆試題庫(kù)
- 2023-2024學(xué)年北京市海淀區(qū)高二(上)期末語(yǔ)文試卷
- 《真希望你也喜歡自己》房琪-讀書(shū)分享
- 2025年教師資格考試高中物理面試試題與參考答案
- 粵人版(2024新版)七年級(jí)上冊(cè)地理期末復(fù)習(xí)考點(diǎn)背誦提綱
- 《危險(xiǎn)化學(xué)品建設(shè)項(xiàng)目安全設(shè)施設(shè)計(jì)專(zhuān)篇編制導(dǎo)則》編制說(shuō)明
- 化妝品合伙人協(xié)議書(shū)模板
評(píng)論
0/150
提交評(píng)論