论文标题

基于虚拟世界探索和视听感官替代的数学内容浏览印刷读取器

Mathematical Content Browsing for Print-Disabled Readers Based on Virtual-World Exploration and Audio-Visual Sensory substitution

论文作者

Kruger, Rynhardt, de Wet, Febe, Niesler, Thomas

论文摘要

包含数学内容的文档在很大程度上是盲目和视觉障碍读者无法获得的,因为它们主要以未标记的PDF发表,其中不包括有效可访问性所需的语义数据。我们为专门针对此类数学内容的打印读者提供了一种浏览方法。这种方法借鉴了通常用于探索文本冒险游戏的虚拟世界的导航机制,并以视听感官替代图形内容。方程元素的相对空间位置表示为虚拟世界,以便读者可以从元素到元素导航。文本元素通常使用综合语音宣布,而图形元素(例如根和分数线)是使用语音算法修改渲染的。虚拟世界允许读者交互地发现方程的空间结构,而图形元素作为声音的演绎允许无法综合的元素的形状和身份,这些元素无法合成为语音被发现和识别。在用户试验中,十一名盲人和14位视力参与者评估了浏览方法,其中包括识别从PDF文档中提取的十二个方程式。总体而言,在78%的病例中,完全正确地确定了方程式(盲人和视力受试者分别为74%和83%)。如果考虑部分正确性,则性能要高得多。我们得出的结论是,与虚拟世界的空间模型与非文本元素的音频感觉替代结合在一起,可以是盲目和视觉上受损的读者阅读PDF文档中当前无法访问的数学内容的有效方法。

Documents containing mathematical content remain largely inaccessible to blind and visually impaired readers because they are predominantly published as untagged PDF which does not include the semantic data necessary for effective accessibility. We present a browsing approach for print-disabled readers specifically aimed at such mathematical content. This approach draws on the navigational mechanisms often used to explore the virtual worlds of text adventure games with audio-visual sensory substitution for graphical content. The relative spatial placement of the elements of an equation are represented as a virtual world, so that the reader can navigate from element to element. Text elements are announced conventionally using synthesised speech while graphical elements, such as roots and fraction lines, are rendered using a modification of the vOICe algorithm. The virtual world allows the reader to interactively discover the spatial structure of the equation, while the rendition of graphical elements as sound allows the shape and identity of elements that cannot be synthesised as speech to be discovered and recognised. The browsing approach was evaluated by eleven blind and fourteen sighted participants in a user trial that included the identification of twelve equations extracted from PDF documents. Overall, equations were identified completely correctly in 78% of cases (74% and 83% respectively for blind and sighted subjects). If partial correctness is considered, the performance is substantially higher. We conclude that the integration of a spatial model represented as a virtual world in conjunction with audio-visual sensory substitution for non-textual elements can be an effective way for blind and visually impaired readers to read currently inaccessible mathematical content in PDF documents.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源