期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Tjong:A transformer‐based Mahjong AI via hierarchical decision‐making and fan backward
1
作者 xiali li Bo liu +2 位作者 Zhi Wei Zhaoqi Wang licheng Wu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第4期982-995,共14页
Mahjong,a complex game with hidden information and sparse rewards,poses significant challenges.Existing Mahjong AIs require substantial hardware resources and extensive datasets to enhance AI capabilities.The authors ... Mahjong,a complex game with hidden information and sparse rewards,poses significant challenges.Existing Mahjong AIs require substantial hardware resources and extensive datasets to enhance AI capabilities.The authors propose a transformer‐based Mahjong AI(Tjong)via hierarchical decision‐making.By utilising self‐attention mechanisms,Tjong effectively captures tile patterns and game dynamics,and it decouples the decision pro-cess into two distinct stages:action decision and tile decision.This design reduces de-cision complexity considerably.Additionally,a fan backward technique is proposed to address the sparse rewards by allocating reversed rewards for actions based on winning hands.Tjong consists of 15M parameters and is trained using approximately 0.5 M data over 7 days of supervised learning on a single server with 2 GPUs.The action decision achieved an accuracy of 94.63%,while the claim decision attained 98.55%and the discard decision reached 81.51%.In a tournament format,Tjong outperformed AIs(CNN,MLP,RNN,ResNet,VIT),achieving scores up to 230%higher than its opponents.Further-more,after 3 days of reinforcement learning training,it ranked within the top 1%on the leaderboard on the Botzone platform. 展开更多
关键词 decision making deep learning deep neural networks
下载PDF
TibetanGoTinyNet:a lightweight U-Net style network for zero learning of Tibetan Go
2
作者 xiali li Yanyin ZHANG +2 位作者 licheng WU Yandong CHEN Junzhi YU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第7期924-937,共14页
The game of Tibetan Go faces the scarcity of expert knowledge and research literature.Therefore,we study the zero learning model of Tibetan Go under limited computing power resources and propose a novel scaleinvariant... The game of Tibetan Go faces the scarcity of expert knowledge and research literature.Therefore,we study the zero learning model of Tibetan Go under limited computing power resources and propose a novel scaleinvariant U-Net style two-headed output lightweight network TibetanGoTinyNet.The lightweight convolutional neural networks and capsule structure are applied to the encoder and decoder of TibetanGoTinyNet to reduce computational burden and achieve better feature extraction results.Several autonomous self-attention mechanisms are integrated into TibetanGoTinyNet to capture the Tibetan Go board’s spatial and global information and select important channels.The training data are generated entirely from self-play games.TibetanGoTinyNet achieves 62%–78%winning rate against other four U-Net style models including Res-UNet,Res-UNet Attention,Ghost-UNet,and Ghost Capsule-UNet.It also achieves 75%winning rate in the ablation experiments on the attention mechanism with embedded positional information.The model saves about 33%of the training time with 45%–50%winning rate for different Monte–Carlo tree search(MCTS)simulation counts when migrated from 9×9 to 11×11 boards.Code for our model is available at https://github.com/paulzyy/TibetanGoTinyNet. 展开更多
关键词 Zero learning Tibetan Go U-Net Self-attention mechanism Capsule network Monte-Carlo tree search
原文传递
Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode 被引量:1
3
作者 Yue Zhao Jianjian Yue +4 位作者 Wei Song Xiaona Xu xiali li licheng Wu Qiang Ji 《Journal on Internet of Things》 2019年第1期17-23,共7页
We proposed a method using latent regression Bayesian network (LRBN) toextract the shared speech feature for the input of end-to-end speech recognition model.The structure of LRBN is compact and its parameter learning... We proposed a method using latent regression Bayesian network (LRBN) toextract the shared speech feature for the input of end-to-end speech recognition model.The structure of LRBN is compact and its parameter learning is fast. Compared withConvolutional Neural Network, it has a simpler and understood structure and lessparameters to learn. Experimental results show that the advantage of hybridLRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classificationarchitecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN ishelpful to differentiate among multiple language speech sets. 展开更多
关键词 Multi-dialect speech recognition Tibetan language latent regressionbayesian network end-to-end model
下载PDF
A modified YOLOv4 detection method for a vision-based underwater garbage cleaning robot 被引量:5
4
作者 Manjun TIAN xiali li +2 位作者 Shihan KONG licheng WU Junzhi YU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2022年第8期1217-1228,共12页
To tackle the problem of aquatic environment pollution,a vision-based autonomous underwater garbage cleaning robot has been developed in our laboratory.We propose a garbage detection method based on a modified YOLOv4,... To tackle the problem of aquatic environment pollution,a vision-based autonomous underwater garbage cleaning robot has been developed in our laboratory.We propose a garbage detection method based on a modified YOLOv4,allowing high-speed and high-precision object detection.Specifically,the YOLOv4 algorithm is chosen as a basic neural network framework to perform object detection.With the purpose of further improvement on the detection accuracy,YOLOv4 is transformed into a four-scale detection method.To improve the detection speed,model pruning is applied to the new model.By virtue of the improved detection methods,the robot can collect garbage autonomously.The detection speed is up to 66.67 frames/s with a mean average precision(mAP)of 95.099%,and experimental results demonstrate that both the detection speed and the accuracy of the improved YOLOv4 are excellent. 展开更多
关键词 Object detection Aquatic environment Garbage cleaning robot Modified YOLOv4
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部