Skip to main navigation Skip to search Skip to main content

Question classification by approximating semantics

  • Guangyu Feng
  • , Kun Xiong
  • , Yang Tang
  • , Anqi Cui
  • , Jing Bai
  • , Hang Li
  • , Qiang Yang
  • , Ming Li

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

A central task of computational linguistics is to decide if two pieces of texts have similar meanings. Ideally, this depends on an intuitive notion of semantic distance. While this semantic distance is most likely unde nable and uncomputable, in practice it is approximated heuristically, consciously or unconsciously. In this paper, we present a theory, and its implementation, to approximate the elusive semantic distance by the well-de ned information distance. It is mathematically proven that any computable approximation of the intuitive concept of semantic distance is covered" by our theory. We have implemented our theory to question answering (QA) and performed experiments based on data extracted from over 35 million question-answer pairs. Experiments demonstrate that our initial implementation of the theory produces convincingly fewer errors inspecification compared to other academic models and commercial systems.

Original languageEnglish
Title of host publicationWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
PublisherAssociation for Computing Machinery, Inc
Pages407-417
Number of pages11
ISBN (Electronic)9781450334730
DOIs
Publication statusPublished - 18 May 2015
Event24th International Conference on World Wide Web, WWW 2015 - Florence, Italy
Duration: 18 May 201522 May 2015

Publication series

NameWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web

Conference

Conference24th International Conference on World Wide Web, WWW 2015
Country/TerritoryItaly
CityFlorence
Period18/05/1522/05/15

Keywords

  • Information distance
  • Question answering
  • Semantic distance
  • Textspecification

Fingerprint

Dive into the research topics of 'Question classification by approximating semantics'. Together they form a unique fingerprint.

Cite this