Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the...Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the most important applications for the Internet of Things. In practice, it is hard to get enough real-world samples to generate the classifi- ers for some special audio events (e.g., car-crash- ing in the smart traffic system). In this paper, we introduce a TrAdaBoost-based method to solve the above problem. By using the proposed approach, we can train a strong classifier by using only a tiny amount of real-world data and a large number of more easily collected samples (e.g., collected from TV programs), even when the real-world data is not sufficient to train a model alone. We deploy this ap- proach in a smart traffic system to evaluate its per- formance, and the experiment evaluations demonstrate that our method can achieve satisfying results.展开更多
In this paper, we present an approach to improve the accuracy of environmental sound event detection in a wireless acoustic sensor network for home monitoring. Wireless acoustic sensor nodes can capture sounds in the ...In this paper, we present an approach to improve the accuracy of environmental sound event detection in a wireless acoustic sensor network for home monitoring. Wireless acoustic sensor nodes can capture sounds in the home and simultaneously deliver them to a sink node for sound event detection. The proposed approach is mainly composed of three modules, including signal estimation, reliable sensor channel selection, and sound event detection. During signal estimation, lost packets are recovered to improve the signal quality. Next, reliable channels are selected using a multi-channel cross-correlation coefficient to improve the computational efficiency for distant sound event detection without sacrificing performance. Finally, the signals of the selected two channels are used for environmental sound event detection based on bidirectional gated recurrent neural networks using two-channel audio features. Experiments show that the proposed approach achieves superior performances compared to the baseline.展开更多
首先,介绍在互联网技术加成下的中、大型体育赛事直播的电子现场制作(Electronic Field Production,EFP)系统的设计与应用。然后,以安徽广播电视台“皖美山水”骑行赛直播为案例,探讨了如何在优质的网络环境下,不采用高成本的转播车、...首先,介绍在互联网技术加成下的中、大型体育赛事直播的电子现场制作(Electronic Field Production,EFP)系统的设计与应用。然后,以安徽广播电视台“皖美山水”骑行赛直播为案例,探讨了如何在优质的网络环境下,不采用高成本的转播车、卫星车,转而搭建一套安全可靠实用的EFP系统,与演播室联动,顺利完成直播。展开更多
足球视频精彩事件检测一直是视频语义分析领域研究的热点和难点.文中利用隐条件随机场(hidden conditional random field,HCRF)模型在表达和识别语义事件方面的强大功能,提出一种多维语义线索和HCRF的角球、点球和红黄牌精彩事件检测框...足球视频精彩事件检测一直是视频语义分析领域研究的热点和难点.文中利用隐条件随机场(hidden conditional random field,HCRF)模型在表达和识别语义事件方面的强大功能,提出一种多维语义线索和HCRF的角球、点球和红黄牌精彩事件检测框架.首先通过对精彩事件视频结构语义进行分析,定义了10种多维语义线索,以准确描述精彩事件富含的语义信息;然后对视频片段进行物理镜头分割,对镜头关键帧提取多维语义线索得到特征矢量,再将测试视频片段中所有镜头的特征矢量共同构成观察序列;最后在小规模训练样本的情况下将观察序列作为HCRF模型的输入,建立了精彩事件检测的HCRF模型.文中基于音视频底层特征、多维语义线索及精彩语义事件之间的映射关系,从视频结构语义的多个维度挖掘了精彩事件的内在规律,准确地实现了精彩事件的检测.实验结果表明了该框架的有效性.展开更多
基金supported by the National Natural Science Foundation of China(No.60821001)the National Basic Research Program of China(No.2007CB311203)
文摘Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the most important applications for the Internet of Things. In practice, it is hard to get enough real-world samples to generate the classifi- ers for some special audio events (e.g., car-crash- ing in the smart traffic system). In this paper, we introduce a TrAdaBoost-based method to solve the above problem. By using the proposed approach, we can train a strong classifier by using only a tiny amount of real-world data and a large number of more easily collected samples (e.g., collected from TV programs), even when the real-world data is not sufficient to train a model alone. We deploy this ap- proach in a smart traffic system to evaluate its per- formance, and the experiment evaluations demonstrate that our method can achieve satisfying results.
基金supported by Basic Science Research Program through the National Research Foundation of Korea(NRF) funded by the Ministry of Education (NRF2015R1D1A1A01059804)the MSIP (Ministry of Science,ICT and Future Planning),Korea,under the ITRC(Information Technology Research Center) support program (IITP-2016-R2718-16-0011) supervised by the IITP(Institute for Information & communications Technology Promotion)the present Research has been conducted by the Research Grant of Kwangwoon University in 2017
文摘In this paper, we present an approach to improve the accuracy of environmental sound event detection in a wireless acoustic sensor network for home monitoring. Wireless acoustic sensor nodes can capture sounds in the home and simultaneously deliver them to a sink node for sound event detection. The proposed approach is mainly composed of three modules, including signal estimation, reliable sensor channel selection, and sound event detection. During signal estimation, lost packets are recovered to improve the signal quality. Next, reliable channels are selected using a multi-channel cross-correlation coefficient to improve the computational efficiency for distant sound event detection without sacrificing performance. Finally, the signals of the selected two channels are used for environmental sound event detection based on bidirectional gated recurrent neural networks using two-channel audio features. Experiments show that the proposed approach achieves superior performances compared to the baseline.
文摘首先,介绍在互联网技术加成下的中、大型体育赛事直播的电子现场制作(Electronic Field Production,EFP)系统的设计与应用。然后,以安徽广播电视台“皖美山水”骑行赛直播为案例,探讨了如何在优质的网络环境下,不采用高成本的转播车、卫星车,转而搭建一套安全可靠实用的EFP系统,与演播室联动,顺利完成直播。
文摘足球视频精彩事件检测一直是视频语义分析领域研究的热点和难点.文中利用隐条件随机场(hidden conditional random field,HCRF)模型在表达和识别语义事件方面的强大功能,提出一种多维语义线索和HCRF的角球、点球和红黄牌精彩事件检测框架.首先通过对精彩事件视频结构语义进行分析,定义了10种多维语义线索,以准确描述精彩事件富含的语义信息;然后对视频片段进行物理镜头分割,对镜头关键帧提取多维语义线索得到特征矢量,再将测试视频片段中所有镜头的特征矢量共同构成观察序列;最后在小规模训练样本的情况下将观察序列作为HCRF模型的输入,建立了精彩事件检测的HCRF模型.文中基于音视频底层特征、多维语义线索及精彩语义事件之间的映射关系,从视频结构语义的多个维度挖掘了精彩事件的内在规律,准确地实现了精彩事件的检测.实验结果表明了该框架的有效性.