摘要
中山大学中文系神经语言学教学实验室面向语言障碍筛查开发的汉语儿童言语交际水平评估系统,以一套固定程序作为引导,能在短时间内快速采集儿童的言语数据。基于这个评估范式,实验室采集了大量2~14岁儿童言语交际过程中的言语数据,从语音、能产性、流畅度、语法、语义、逻辑六大语言维度出发,细分为16项指标对语料进行人工标注和机器识别,建立起一个应用于语言能力评估和语言障碍筛查的汉语儿童言语数据库,可以精准评估汉语儿童的言语交际水平。目前该语料库储存了966名汉语儿童的言语数据,并对638名儿童的语料进行了标注。该语料库可以对儿童语言障碍的智能化筛查提供机器学习训练数据,也可以为研究汉语儿童语言习得和各类儿童语言障碍提供数据资源支持。
Language is an indispensable communication tool for human beings,and language ability is an essential skill that children must acquire in their development.Oriented to the language disorders in Chinese-speaking children,an evaluation system has been developed by the Neurolinguistics teaching laboratory at Sun Yat-sen University to measure Chinese children’s speech communication ability and screen language-related disabilities.Using a fixed procedure as a guide,the system can collect children’s speech communication data in a very short time.Based on this evaluation paradigm,a speech corpus of Chi-nese-speaking children for language disorder screening was established,and up to now data of 996 children aged between 2-14 have been collected.The data are evaluated from six linguistic aspects(including phonology,productivity,fl uency,grammar,semantics,and logic)with 16 indicators recognized by both manual annotation and machine recognition.Currently,the data of 638 Chinese-speaking children have been processed and annotated.Such a corpus can off er an affl uent training set for automatic screening of children’s language disorders,and provide resource for studies on language acquisition and language disorders.
作者
陆烁
丘国新
钱思宇
高乐妍
Lu Shuo;Qiu Guoxin;Qian Siyu;Gao Leyan
出处
《语言战略研究》
CSSCI
北大核心
2021年第6期45-58,共14页
Chinese Journal of Language Policy and Planning
关键词
儿童语言障碍
语言评估
言语交际
数据库
语料库
Children’s language disorder
language evaluation
speech communication
data base
corpus