TY - JOUR
T1 - Syntactic parsing of clause constituents for statistical machine translation
AU - Ma, Jianjun
AU - Pei, Jiahuan
AU - Huang, Degen
AU - Song, Dingxin
PY - 2018
Y1 - 2018
N2 - The clause is considered as the basic unit of grammar in linguistics, which is a structure between a chunk and a sentence. Clause constituents, therefore, are one important kind of linguistically valid syntactic phrases. This paper adopts the CRFs model to recognise English clause constituents with their syntactic functions, and testifies their effect on machine translation by applying this syntactic information to an English-Chinese PBSMT system, evaluated on a corpus of business domain. Clause constituents are mainly classified into six kinds: subject, predicate, complement, adjunct, residues of predicate, and residues of complement. Results show that our rich-feature CRFs model achieves an F-measure of 93.31%, a precision of 93.26%, and a recall of 93.04%. This syntactic knowledge in the source language is further combined with the NiuTrans phrasal SMT system, which slightly improves the English-Chinese translation accuracy.
AB - The clause is considered as the basic unit of grammar in linguistics, which is a structure between a chunk and a sentence. Clause constituents, therefore, are one important kind of linguistically valid syntactic phrases. This paper adopts the CRFs model to recognise English clause constituents with their syntactic functions, and testifies their effect on machine translation by applying this syntactic information to an English-Chinese PBSMT system, evaluated on a corpus of business domain. Clause constituents are mainly classified into six kinds: subject, predicate, complement, adjunct, residues of predicate, and residues of complement. Results show that our rich-feature CRFs model achieves an F-measure of 93.31%, a precision of 93.26%, and a recall of 93.04%. This syntactic knowledge in the source language is further combined with the NiuTrans phrasal SMT system, which slightly improves the English-Chinese translation accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85052866469&partnerID=8YFLogxK
U2 - 10.1504/ijcse.2018.094424
DO - 10.1504/ijcse.2018.094424
M3 - Article
SN - 1742-7185
VL - 17
SP - 126
EP - 132
JO - International Journal of Computational Science and Engineering
JF - International Journal of Computational Science and Engineering
IS - 1
ER -