Automatic Recognition of Focus and Interrogative Word in Chinese Question for Classification
- Zhichang Zhang
- Yu Zhang
- Ting Liu
- Sheng Li
Abstract
Question classification is one of the most important components in a question answering (QA) system. When there are fewer features in a question can be used for classification, the interrogative word and focus in question are critical features. Most previous studies in question classification used heuristic rules to identify the focus and interrogative word in question. In this paper, a statistical method is explored to automatically label them for Chinese question using condition random fields (CRFs) model. The features for CRFs model are extracted from word segmentation, part-of-speech (POS) tagging, named entity recognition, and dependency parsing results. A knowledge base HowNet is also used. The experimental results show that the precision for interrogative word recognition is 98.97% and 90.85% of focus can be correctly recognized in a free available Chinese question data set.
- Full Text: PDF
- DOI:10.5539/cis.v3n1p168
Journal Metrics
WJCI (2022): 0.636
Impact Factor 2022 (by WJCI): 0.419
h-index (January 2024): 43
i10-index (January 2024): 193
h5-index (January 2024): N/A
h5-median(January 2024): N/A
( The data was calculated based on Google Scholar Citations. Click Here to Learn More. )
Index
- Academic Journals Database
- BASE (Bielefeld Academic Search Engine)
- CiteFactor
- CNKI Scholar
- COPAC
- CrossRef
- DBLP (2008-2019)
- EBSCOhost
- EuroPub Database
- Excellence in Research for Australia (ERA)
- Genamics JournalSeek
- Google Scholar
- Harvard Library
- Infotrieve
- LOCKSS
- Mendeley
- PKP Open Archives Harvester
- Publons
- ResearchGate
- Scilit
- SHERPA/RoMEO
- Standard Periodical Directory
- The Index of Information Systems Journals
- The Keepers Registry
- UCR Library
- Universe Digital Library
- WJCI Report
- WorldCat
Contact
- Chris LeeEditorial Assistant
- cis@ccsenet.org