In machine translation(MT) practice,there is an urgent need for constructing a set of Chinese-to-English aspect transferring rules to define the transferring conditions.The integrated feature set was used to generaliz...In machine translation(MT) practice,there is an urgent need for constructing a set of Chinese-to-English aspect transferring rules to define the transferring conditions.The integrated feature set was used to generalize and justify the Chinese-to-English transferring rule of the 'ZHE' aspect(ZHE Rule).A ZHE classification model was built in this study.The impacts of each set of temporal,lexical aspectual,and syntactic features,and their integrated impacts,on the accuracy of the ZHE Rule were tested.Over 600 misclassified corpus sentences were manually examined.A 10-fold cross-validation was used with a decision tree algorithm.The main results are:(1) The ZHE Rule was generalized and justified to have a higher accuracy under the two metrics:the precision rate and the areas under the receiver operating characteristic curve(AUC).(2) The temporal,lexical aspectual,and syntactic feature sets have an integrated contribution to the accuracy of the ZHE Rule.The syntactic and temporal features have an impact on ZHE aspect derivations,while the lexical aspectual features are not predictive of ZHE aspect derivation.(3) While associated with active verbs,the ZHE aspect can denote a perfective situation.This study suggests that the temporal and syntactic features are the predictive ZHE aspect classification features and that the ZHE Rule with an overall precision rate of 80.1% is accurate enough to be further explored in MT practice.The machine learning method,decision tree,can be applied to the automatic aspect transferring in MT research and aspectual interpretations in linguistic research.展开更多
基金supported by the National Social Science Foundation of China(No.08BYY001)the Worldwide Universities Network 2009 Research Mobility Programme
文摘In machine translation(MT) practice,there is an urgent need for constructing a set of Chinese-to-English aspect transferring rules to define the transferring conditions.The integrated feature set was used to generalize and justify the Chinese-to-English transferring rule of the 'ZHE' aspect(ZHE Rule).A ZHE classification model was built in this study.The impacts of each set of temporal,lexical aspectual,and syntactic features,and their integrated impacts,on the accuracy of the ZHE Rule were tested.Over 600 misclassified corpus sentences were manually examined.A 10-fold cross-validation was used with a decision tree algorithm.The main results are:(1) The ZHE Rule was generalized and justified to have a higher accuracy under the two metrics:the precision rate and the areas under the receiver operating characteristic curve(AUC).(2) The temporal,lexical aspectual,and syntactic feature sets have an integrated contribution to the accuracy of the ZHE Rule.The syntactic and temporal features have an impact on ZHE aspect derivations,while the lexical aspectual features are not predictive of ZHE aspect derivation.(3) While associated with active verbs,the ZHE aspect can denote a perfective situation.This study suggests that the temporal and syntactic features are the predictive ZHE aspect classification features and that the ZHE Rule with an overall precision rate of 80.1% is accurate enough to be further explored in MT practice.The machine learning method,decision tree,can be applied to the automatic aspect transferring in MT research and aspectual interpretations in linguistic research.