SALSA Version 1.0: A Speech-based Web Browser for Hong Kong English

Pascale Fung, Chi Shun Cheung, Kwok Leung Lam, Wai Kat Liu, Yuen Yee Lo

Research output: Contribution to conferenceConference Paperpeer-review

Abstract

In this paper, we present a prototype speech-based Web browser, SALSA1.0, and describe some of the research issues we need to address while building this system for Hong Kong users. SALSA1.0 allows the user to speak English command words as well as partial or complete link names on any page. The research issues involved in building SALSA1.0 are mainly (1) how to handle large accent variations and mixed-language and (2) how to handle unknown words, especially proper names, in Web links. The recognition engine for SALSA1.0 is trained on WSJ data, and then retrained on a small amount of Hong Kong accent WSJ data to handle accent variations. An edit-distance algorithm is used to replace all unknown words by the closest known word in the word network for recognition. With these methods, link name recognition rate is at 91.20% for links without unknown words, and 82.40% for links with unknown words. SALSA is currently being developed into a multilingual, natural language-based Intranet service provider for HKUST campus information access.

Original languageEnglish
Publication statusPublished - 1998
Event5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia
Duration: 30 Nov 19984 Dec 1998

Conference

Conference5th International Conference on Spoken Language Processing, ICSLP 1998
Country/TerritoryAustralia
CitySydney
Period30/11/984/12/98

Bibliographical note

Publisher Copyright:
© 1998. 5th International Conference on Spoken Language Processing, ICSLP 1998. All rights reserved.

Fingerprint

Dive into the research topics of 'SALSA Version 1.0: A Speech-based Web Browser for Hong Kong English'. Together they form a unique fingerprint.

Cite this