Purpose-When a large number of project proposals are evaluated to alocate available funds,grouping them based on their simiarites is benefciaL.Current approaches to group proposals are primarily based on manual matchi...Purpose-When a large number of project proposals are evaluated to alocate available funds,grouping them based on their simiarites is benefciaL.Current approaches to group proposals are primarily based on manual matching of similar topics,discipline areas and keywordls declared by project applicants.When the number of proposals increases,this task becomes complex and requires excessive time.This paper aims to demonstrate how to ffctively use the rich information in the titles and abstracts of Turkish project propsals to group them atmaially.Design/methodology/approach-This study proposes a model that effectively groups Turkish project proposals by combining word embedding,clustering and classification technigues.The proposed model uses FastText,BERT and term frequency/inverse document frequency(TF/IDF)word-embedding techniques to extract terms from the titles and abstracts of project proposals in Turkish.The extracted terms were grouped using both the clustering and classification techniques.Natural groups contained within the corpus were discovered using k-means,k-means++,k-medoids and agglomerative clustering algorithms,Additionally,this study employs classification approaches to predict the target class for each document in the corpus.To classify project proposals,var ious classifiers,including k nearest neighbors(KNN),support vector machines(SVM),artificial neural networks(ANN),cassftcation and regression trees(CART)and random forest(RF),are used.Empirical experiments were conducted to validate the effectiveness of the proposed method by using real data from the Istanbul Development Agency.Findings-The results show that the generated word embeddings an fftvely represent proposal texts as vectors,and can be used as inputs for dustering or casificatiomn algorithms.Using clustering algorithms,the document corpus is divided into five groups.In adition,the results demonstrate that the proposals can easily be categoried into predefmned categories using cassifiation algorithms.SVM-Linear achieved the highest prediction accuracy(89.2%)with the FastText word embedding.method.A comparison of mamual grouping with automatic casification and clutering results revealed that both classification and custering techniques have a high sucess rate.Research limitations/implications-The propsed mdelatomatically benefits fromthe rich information in project proposals and significantly reduces numerous time consuming tasks that managers must perform manually.Thus,it eliminates the drawbacks of the curent manual methods and yields significantly more acurate results.In the future,additional experiments should be conducted to validate the proposed method using data from other funding organizations.Originality/value-This study presents the application of word embedding methods to eftively use the rich information in the titles and abstracts of Turkish project proposals.Existing research studies focus on the automatice grouping of proposals;traditional frequency-based word embedding methods are used for feature extraction methods to represent project proposals.Unlike previous research,this study employs two outperforming neural network-based textual feature extraction techniques to obtain termns representing the proposals:BERT as a contextual word embedding method and F astText as a static word embedding method.Moreover,to the best of our knowledge,there has been no research conducted on the grouping of project proposals in Turkish.展开更多
It is challenging to track a target continuously in videos with long-term occlusion,or objects which leave then re-enter a scene.Existing tracking algorithms combined with onlinetrained object detectors perform unreli...It is challenging to track a target continuously in videos with long-term occlusion,or objects which leave then re-enter a scene.Existing tracking algorithms combined with onlinetrained object detectors perform unreliably in complex conditions, and can only provide discontinuous trajectories with jumps in position when the object is occluded. This paper proposes a novel framework of tracking-by-detection using selection and completion to solve the abovementioned problems. It has two components, tracking and trajectory completion. An offline-trained object detector can localize objects in the same category as the object being tracked. The object detector is based on a highly accurate deep learning model. The object selector determines which object should be used to re-initialize a traditional tracker. As the object selector is trained online,it allows the framework to be adaptable. During completion, a predictive non-linear autoregressive neural network completes any discontinuous trajectory.The tracking component is an online real-time algorithm, and the completion part is an after-theevent mechanism. Quantitative experiments show a significant improvement in robustness over prior stateof-the-art methods.展开更多
文摘Purpose-When a large number of project proposals are evaluated to alocate available funds,grouping them based on their simiarites is benefciaL.Current approaches to group proposals are primarily based on manual matching of similar topics,discipline areas and keywordls declared by project applicants.When the number of proposals increases,this task becomes complex and requires excessive time.This paper aims to demonstrate how to ffctively use the rich information in the titles and abstracts of Turkish project propsals to group them atmaially.Design/methodology/approach-This study proposes a model that effectively groups Turkish project proposals by combining word embedding,clustering and classification technigues.The proposed model uses FastText,BERT and term frequency/inverse document frequency(TF/IDF)word-embedding techniques to extract terms from the titles and abstracts of project proposals in Turkish.The extracted terms were grouped using both the clustering and classification techniques.Natural groups contained within the corpus were discovered using k-means,k-means++,k-medoids and agglomerative clustering algorithms,Additionally,this study employs classification approaches to predict the target class for each document in the corpus.To classify project proposals,var ious classifiers,including k nearest neighbors(KNN),support vector machines(SVM),artificial neural networks(ANN),cassftcation and regression trees(CART)and random forest(RF),are used.Empirical experiments were conducted to validate the effectiveness of the proposed method by using real data from the Istanbul Development Agency.Findings-The results show that the generated word embeddings an fftvely represent proposal texts as vectors,and can be used as inputs for dustering or casificatiomn algorithms.Using clustering algorithms,the document corpus is divided into five groups.In adition,the results demonstrate that the proposals can easily be categoried into predefmned categories using cassifiation algorithms.SVM-Linear achieved the highest prediction accuracy(89.2%)with the FastText word embedding.method.A comparison of mamual grouping with automatic casification and clutering results revealed that both classification and custering techniques have a high sucess rate.Research limitations/implications-The propsed mdelatomatically benefits fromthe rich information in project proposals and significantly reduces numerous time consuming tasks that managers must perform manually.Thus,it eliminates the drawbacks of the curent manual methods and yields significantly more acurate results.In the future,additional experiments should be conducted to validate the proposed method using data from other funding organizations.Originality/value-This study presents the application of word embedding methods to eftively use the rich information in the titles and abstracts of Turkish project proposals.Existing research studies focus on the automatice grouping of proposals;traditional frequency-based word embedding methods are used for feature extraction methods to represent project proposals.Unlike previous research,this study employs two outperforming neural network-based textual feature extraction techniques to obtain termns representing the proposals:BERT as a contextual word embedding method and F astText as a static word embedding method.Moreover,to the best of our knowledge,there has been no research conducted on the grouping of project proposals in Turkish.
基金supported by the National Natural Science Foundation of China (Project No. 61521002)the General Financial Grant from the China Postdoctoral Science Foundation (Grant No. 2015M580100)a Research Grant of Beijing Higher Institution Engineering Research Center, and an EPSRC Travel Grant
文摘It is challenging to track a target continuously in videos with long-term occlusion,or objects which leave then re-enter a scene.Existing tracking algorithms combined with onlinetrained object detectors perform unreliably in complex conditions, and can only provide discontinuous trajectories with jumps in position when the object is occluded. This paper proposes a novel framework of tracking-by-detection using selection and completion to solve the abovementioned problems. It has two components, tracking and trajectory completion. An offline-trained object detector can localize objects in the same category as the object being tracked. The object detector is based on a highly accurate deep learning model. The object selector determines which object should be used to re-initialize a traditional tracker. As the object selector is trained online,it allows the framework to be adaptable. During completion, a predictive non-linear autoregressive neural network completes any discontinuous trajectory.The tracking component is an online real-time algorithm, and the completion part is an after-theevent mechanism. Quantitative experiments show a significant improvement in robustness over prior stateof-the-art methods.