期刊文献+
共找到93篇文章
< 1 2 5 >
每页显示 20 50 100
The Grammatical Categorization of Mandarin在/zài:Spatiality,Temporality and Semantic Construal
1
作者 LIU Xing 《Journal of Literature and Art Studies》 2023年第4期295-303,共9页
Mandarin在(pinyin:zài)is the most frequently used character in representing spatial and temporal relationship.Current studies mostly focus on its lexical meaning and syntactic structure while cognitive features o... Mandarin在(pinyin:zài)is the most frequently used character in representing spatial and temporal relationship.Current studies mostly focus on its lexical meaning and syntactic structure while cognitive features of its grammatical categories have been neglected.This paper investigates into the categorization of zài by conducting a morphosyntactic test among College English majors in China.The results show that:prototypes are organizing the grammatical categories of zài at all levels in terms of intra-categorial gradience;the semantic construal of zài construction could significantly influence the accuracy of the grammatical categorization of zài;the syntactic structure can provide viable cue for the identification of grammatical categories of zài;spatiality,temporality and the status of existing are three essential semantic features encoded by zài,the concurrence of which leads to various degree of inter-categorial vagueness,indicating a conflict between the rigid grammatical classification and the indeterminate nature of the grammatical functions of zai,suggesting the necessity to reconsider the efficacy of applying indiscriminately the Anglo-Saxon grammar into the study of Chinese spatial-temporal constructions. 展开更多
关键词 Mandarin zài grammatical categorization TEMPORALITY SPATIALITY semantic construal
下载PDF
A Study on Second Language Vocabulary Acquisition Under the Categorization Theory
2
作者 Ting Xiao 《Journal of Contemporary Educational Research》 2023年第12期142-150,共9页
In cognitive linguistics,debates on the status and functions of categorization have been a heated issue.In semantics and second language acquisition,scholars have discussed and achieved vocabulary acquisition from dif... In cognitive linguistics,debates on the status and functions of categorization have been a heated issue.In semantics and second language acquisition,scholars have discussed and achieved vocabulary acquisition from different perspectives and academic levels.Vocabulary learning exerts a fundamental role in second language vocabulary acquisition(SLVA),and it is closely related to learners’cognitive competence.However,studies on second language vocabulary acquisition under the categorization theory in cognitive linguistics have received less attention from linguists when compared with other studies.This paper employs two representative dimensions,the basic-level effect and the prototype effect,under the categorization theory to further delve into the implications on second language vocabulary acquisition.This article first provides a comprehensive introduction to the nature and the approaches of the categorization theory,and then analyzes the relations and implications for second language vocabulary acquisition under the categorization theory from the perspective of the basic-level and the prototype effects.The research results showed that the basic-level effect on SLVA is mainly on the classification of word categories distinguished from the superordinate and subordinate categories,while the prototype effect is more on understanding the complexity and use of word meaning. 展开更多
关键词 categorization Second language vocabulary acquisition Basic-level PROTOTYPE
下载PDF
Research of Web Pages Categorization
3
作者 Zhongda Lin Kun Deng Yanfen Hong 《南昌工程学院学报》 CAS 2006年第2期107-111,共5页
In this paper, we discuss several issues related to automated classification of web pages, especially text classification of web pages. We analyze features selection and categorization algorithms of web pages and give... In this paper, we discuss several issues related to automated classification of web pages, especially text classification of web pages. We analyze features selection and categorization algorithms of web pages and give some suggestions for web pages categorization. 展开更多
关键词 web pages categorization VSM categorization algorithms feature selection
下载PDF
A New Approach of Feature Selection for Text Categorization 被引量:6
4
作者 CUI Zifeng XU Baowen +1 位作者 ZHANG Weifeng XU Junling 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1335-1339,共5页
This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of e... This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of each other, widely used in the probabilistic models for text categorization (TC), is discussed. However, the basic hypothesis is incom plete for independence of feature set. From the view of feature selection, a new independent measure between features is designed, by which a feature selection algorithm is given to ob rain a feature subset. The selected subset is high in relevance with category and strong in independence between features, satisfies the basic hypothesis at maximum degree. Compared with other traditional feature selection method in TC (which is only taken into the relevance account), the performance of feature subset selected by our method is prior to others with experiments on the benchmark dataset of 20 Newsgroups. 展开更多
关键词 feature selection independency CHI square test text categorization
下载PDF
Comparison of Text Categorization Algorithms 被引量:4
5
作者 SHIYong-feng ZHAOYan-ping 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期798-804,共7页
This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classi... This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classifying algorithms in different fields. Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out. Key words text categorization - naive bayes - KNN - SVM - neural network CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (70031010) and the Research Foundation of Beijing Institute of TechnologyBiography: SHI Yong-feng (1980-), male, Master candidate, research direction: web information mining. 展开更多
关键词 text categorization naive bayes KNN SVM neural network
下载PDF
Lazy learner text categorization algorithm based on embedded feature selection 被引量:1
6
作者 Yan Peng Zheng Xuefeng +1 位作者 Zhu Jianyong Xiao Yunhong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第3期651-659,共9页
To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although havin... To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although having been widely used, FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms. On the basis of the sparsity characteristic of text vectors, a new TC algorithm based on lazy feature selection (LFS) is presented. As a new type of embedded feature selection approach, the LFS method can greatly reduce the dimension of features without any information losing, which can improve both efficiency and performance of algorithms greatly. The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms. 展开更多
关键词 machine learning text categorization embedded feature selection lazy learner cosine similarity.
下载PDF
A Text Categorization System with Soft Real-Time Guarantee 被引量:1
7
作者 WANG Hua-yong CHEN Yu DAI Yi-qi 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期226-229,共4页
In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established... In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4 of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computa tion resources. The proposed methodology enables the theo retical analysis and evaluation to the TC systems, leading to a high-quality and low cost implementation approach. 展开更多
关键词 information retrieval text categorization soft real-time system feedback control theory
下载PDF
Emotional categorization: Individual emotional differences and laterality effects in healthy and persons with multiple sclerosis
8
作者 Farzaneh Pahlavan Tudd Lubart +5 位作者 Michelle Montreuil Stephen Jacob Anne-Yves Jacquet Christelle Lemoine Hélène Petropoulou Franck Zenasni 《Health》 2010年第12期1349-1365,共17页
A study examining affective information processing in persons with Multiple Sclerosis and healthy adults was carried out. It was hypothesized that individual characteristics could modulate participants’ emotional cat... A study examining affective information processing in persons with Multiple Sclerosis and healthy adults was carried out. It was hypothesized that individual characteristics could modulate participants’ emotional categorization and reaction times for categorization decisions. For example, individuals with negative valenced emotional profile (e.g. anxious) should choose negative emotional alternatives faster and more frequently. Participants consisted of two different populations: 80 right-handed healthy French-speakers, and 40 right-handed French- speakers with multiple sclerosis. The results showed a positive correlation between high- level of negative emotional sensibility and emotional categorization (decision and decision speed) for affective information presented on the right-side of the screen. For all participants there were more frequent emotional choices and faster decisions for left-side presented emotional alternatives. It seems individuals’ emotional differences in general and in MS populations modulate hemispheric asymmetry of processing emotional judgments. 展开更多
关键词 Emotion categorization HEMISPHERIC Asymmetry Individual EMOTIONAL DIFFERENCES Multiple SCLEROSIS
下载PDF
Categorization and Reorientation of Images Based on Low Level Features
9
作者 Rajen Bhatt Gaurav Sharma +2 位作者 Abhinav Dhall Naresh Kumar Santanu Chaudhury 《Journal of Intelligent Learning Systems and Applications》 2011年第1期1-10,共10页
A hierarchical system to perform automatic categorization and reorientation of images using content analysis is pre-sented. The proposed system first categorizes images to some a priori defined categories using rotati... A hierarchical system to perform automatic categorization and reorientation of images using content analysis is pre-sented. The proposed system first categorizes images to some a priori defined categories using rotation invariant features. At the second stage, it detects their correct orientation out of {0o, 90o, 180o, and 270o} using category specific model. The system has been specially designed for embedded devices applications using only low level color and edge features. Machine learning algorithms optimized to suit the embedded implementation like support vector machines (SVMs) and scalable boosting have been used to develop classifiers for categorization and orientation detection. Results are presented on a collection of about 7000 consumer images collected from open resources. The proposed system finds it applications to various digital media products and brings pattern recognition solutions to the consumer electronics domain. 展开更多
关键词 categorization Digital Content Management Feature Selection ORIENTATION Detection SCALABLE BOOSTING Support VECTOR Machines
下载PDF
Positive-Negative Emotional Categorization of Clothing Color Based on Brightness
10
作者 Xiao-Feng Jiang Xiao-Pei Bian 《Engineering(科研)》 2013年第10期189-194,共6页
In current study, behavioral measures were conducted to investigate clothing color. The purpose was to focus on the rule that color brightness influencedpositive-negative emotional categorization. Results showed that ... In current study, behavioral measures were conducted to investigate clothing color. The purpose was to focus on the rule that color brightness influencedpositive-negative emotional categorization. Results showed that the effect of brightness on clothing color emotion categorization was significant. With the increase of brightness, the variation curve of positive emotion appears to be a “U-shaped”, whereas that of the negative emotion shows an upside down “U-shaped”. Compared with the low brightness colors, the emotion reaction to the high brightness colors was more positive;Most of the colors with different brightness scales were classified as positive emotions and the minors were classified as negative emotions;the positive colors could be done much faster than the negative ones. 展开更多
关键词 COLOR EMOTION Response Time BRIGHTNESS categorization Positive-Negative
下载PDF
The Role of Rare Terms in Enhancing the Performance of Polynomial Networks Based Text Categorization
11
作者 Mayy M. Al-Tahrawi 《Journal of Intelligent Learning Systems and Applications》 2013年第2期84-89,共6页
In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy... In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy of PNs-based text categorization, different term reduction criteria as well as different term weighting schemes were experimented on the Reuters Corpus using PNs. Each term weighting scheme on each reduced term set was tested once keeping the rare terms and another time removing them. All the experiments conducted in this research show that keeping rare terms substantially improves the performance of Polynomial Networks in Text Categorization, regardless of the term reduction method, the number of terms used in classification, or the term weighting scheme adopted. 展开更多
关键词 POLYNOMIAL NETWORKS TEXT categorization Document Classification Infrequent TERMS RARE TERMS
下载PDF
Survey on Clustering Techniques for Image Categorization Dataset
12
作者 Mohd Afizi Mohd Shukran Mohd Sidek Fadhil Mohd Yunus +5 位作者 Muhammad Naim Abdullah Mohd Rizal Mohd Isa Mohammad Adib Khairuddin Kamaruzaman Maskat Suhaila Ismail Abdul Samad Shibghatullah 《Journal of Computer and Communications》 2022年第6期177-185,共9页
Content Based Image Retrieval, CBIR, performed an automated classification task for a queried image. It could relieve a user from the laborious and time-consuming metadata assigning for an image while working on massi... Content Based Image Retrieval, CBIR, performed an automated classification task for a queried image. It could relieve a user from the laborious and time-consuming metadata assigning for an image while working on massive image collection. For an image, user’s definition or description is subjective where it could belong to different categories as defined by different users. Human based categorization and computer-based categorization might produce different results due to different categorization criteria that rely on dataset structure and the clustering techniques. This paper is aimed to exhibit an idea for planning the dataset structure and choosing the clustering algorithm for CBIR implementation. There are 5 sections arranged in this paper;CBIR and QBE concepts are introduced in Section 1, related image categorization research is listed in Section 2, the 5 type of image clustering are described in Section 3, comparative analysis in Section 4, and Section 5 conclude this study. Outcome of this paper will be benefiting CBIR developer for various applications. 展开更多
关键词 categorization CBIR CLASSIFICATIONS CLUSTERING DATASET
下载PDF
Comparative Study of Chinese-western Cultural Context's Influence on Metaphorical Categorization of “heart” in Original and English Version of Caigentan
13
作者 魏靖 《海外英语》 2017年第7期218-220,共3页
The study of metaphor of"xin"or heart could be traced back to the beginning of 21 stcentury,and a terrific text for the study isthe famous works called Caigentan,whose distinguished English version is transl... The study of metaphor of"xin"or heart could be traced back to the beginning of 21 stcentury,and a terrific text for the study isthe famous works called Caigentan,whose distinguished English version is translated by Paul Whiter.In this paper the metaphors of"xin"in both Chinese and English version from the categorization process are to be analyzed especially their vehicle category and themapping process and thus some similarities and differences could be spotted,and finally the Chinese and western cultural context be-hind them could be deduced and analyzed. 展开更多
关键词 metaphor of "xin" categorization process similarities and differences cultural context
下载PDF
An up -to -date comparative analysis of the KNN classifier distance metrics for text categorization
14
作者 Onder Coban 《Data Science and Informetrics》 2023年第2期67-78,共12页
Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by ... Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights. 展开更多
关键词 Text categorization k-nearest neighbor distance metric distance learning algorithms
原文传递
Smart Approaches to Efficient Text Mining for Categorizing Sexual Reproductive Health Short Messages into Key Themes
15
作者 Tobias Makai Mayumbo Nyirenda 《Open Journal of Applied Sciences》 2024年第2期511-532,共22页
To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved a... To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved access to information on various Sexual Reproductive Health topics through Short Messaging Service (SMS) messages. Over the years, the platform has accumulated millions of incoming and outgoing messages, which need to be categorized into key thematic areas for better tracking of sexual reproductive health knowledge gaps among young people. The current manual categorization process of these text messages is inefficient and time-consuming and this study aims to automate the process for improved analysis using text-mining techniques. Firstly, the study investigates the current text message categorization process and identifies a list of categories adopted by counselors over time which are then used to build and train a categorization model. Secondly, the study presents a proof of concept tool that automates the categorization of U-report messages into key thematic areas using the developed categorization model. Finally, it compares the performance and effectiveness of the developed proof of concept tool against the manual system. The study used a dataset comprising 206,625 text messages. The current process would take roughly 2.82 years to categorise this dataset whereas the trained SVM model would require only 6.4 minutes while achieving an accuracy of 70.4% demonstrating that the automated method is significantly faster, more scalable, and consistent when compared to the current manual categorization. These advantages make the SVM model a more efficient and effective tool for categorizing large unstructured text datasets. These results and the proof-of-concept tool developed demonstrate the potential for enhancing the efficiency and accuracy of message categorization on the Zambia U-report platform and other similar text messages-based platforms. 展开更多
关键词 Knowledge Discovery in Text (KDT) Sexual Reproductive Health (SRH) Text categorization Text Classification Text Extraction Text Mining Feature Extraction Automated Classification Process Performance Stemming and Lemmatization Natural Language Processing (NLP)
下载PDF
Ableism and (Neo)Racism in School Placement Processes in Quebec: School Personnel Interpretations of Immigrant Student Difficulties - A Secondary Publication
16
作者 Tya Collins Corina Borri-Anadon 《Journal of Contemporary Educational Research》 2024年第3期148-160,共13页
The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the sc... The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the school experiences of students from immigrant backgrounds or racialized groups.The research problem of this article concerns the identification of these students as disabled or as having adjustment or learning difficulties.From a perspective anchored in Disability Critical Race Studies,this ethnographic study documents different interpretations of perceived difficulties made by school actors with regard to seven primary school students from immigrant backgrounds.Five interpretation types are presented:(1)medicalization by dismissal of cultural markers,(2)medicalization by professional constraint,(3)medicalization by cultural deficit,(4)precautionary wait,and(5)cultural differentialism.Our results help to shed light on the special education overrepresentation phenomenon regarding these students and to understand how ableism and(neo)racism contribute to it. 展开更多
关键词 categorization in education Learning difficulties and students in difficulty Immigration and ethnicity Educational inclusion and exclusion Canada
下载PDF
Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization 被引量:3
17
作者 Min DU Xing-shu CHEN 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2013年第6期407-416,共10页
Text categorization is a significant technique to manage the surging text data on the Internet.The k-nearest neighbors(kNN) algorithm is an effective,but not efficient,classification model for text categorization.In t... Text categorization is a significant technique to manage the surging text data on the Internet.The k-nearest neighbors(kNN) algorithm is an effective,but not efficient,classification model for text categorization.In this paper,we propose an effective strategy to accelerate the standard kNN,based on a simple principle:usually,near points in space are also near when they are projected into a direction,which means that distant points in the projection direction are also distant in the original space.Using the proposed strategy,most of the irrelevant points can be removed when searching for the k-nearest neighbors of a query point,which greatly decreases the computation cost.Experimental results show that the proposed strategy greatly improves the time performance of the standard kNN,with little degradation in accuracy.Specifically,it is superior in applications that have large and high-dimensional datasets. 展开更多
关键词 k-nearest neighbors(kNN) TEXT categorization Accelerating strategy Principal COMPONENT analysis(PCA)
原文传递
Non-Independent Term Selection for Chinese Text Categorization 被引量:2
18
作者 李景阳 孙茂松 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第1期113-120,共8页
Chinese text categorization differs from English text categorization due to its much larger term set (of words or character n-grams), which results in very slow training and working of modern high-performance classi... Chinese text categorization differs from English text categorization due to its much larger term set (of words or character n-grams), which results in very slow training and working of modern high-performance classifiers. This study assumes that this high-dimensionality problem is related to the redundancy in the term set, which cannot be solved by traditional term selection methods. A greedy algorithm framework named "non-independent term selection" is presented, which reduces the redundancy according to string-level correlations. Several preliminary implementations of this idea are demonstrated. Experiment results show that a good tradeoff can be reached between the performance and the size of the term set. 展开更多
关键词 Chinese text categorization term selection dimentionality
原文传递
A Two-Stage Feature Selection Method for Text Categorization by Using Category Correlation Degree and Latent Semantic Indexing 被引量:2
19
作者 王飞 李彩虹 +2 位作者 王景山 徐娇 李廉 《Journal of Shanghai Jiaotong university(Science)》 EI 2015年第1期44-50,共7页
With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(C... With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(CCD)method and latent semantic indexing(LSI).In the first stage,a novel CCD method is proposed to select the most effective features for text classification,which is more effective than the traditional feature selection method.In the second stage,document representation requires a high dimensionality of the feature space and does not take into account the semantic relation between features,which leads to a poor categorization accuracy.So LSI method is proposed to solve these problems by using statistically derived conceptual indices to replace the individual terms which can discover the important correlative relationship between features and reduce the feature space dimension.Firstly,each feature in our algorithm is ranked depending on their importance of classification using CCD method.Secondly,we construct a new semantic space based on LSI method among features.The experimental results have proved that our method can reduce effectively the dimension of text vector and improve the performance of text categorization. 展开更多
关键词 text categorization feature selection latent semantic indexing(LSI) category correlation degree(CCD)
原文传递
Associative categorization of frequent patterns based on the probabilistic graphical model 被引量:1
20
作者 Weiyi LIU Kun YUE +3 位作者 Hui LIU Ping ZHANG Suiye LIU Qianyi WANG 《Frontiers of Computer Science》 SCIE EI CSCD 2014年第2期265-278,共14页
Discovering the hierarchical structures of differ- ent classes of object behaviors can satisfy the requirements of various degrees of abstraction in association analysis, be- havior modeling, data preprocessing, patte... Discovering the hierarchical structures of differ- ent classes of object behaviors can satisfy the requirements of various degrees of abstraction in association analysis, be- havior modeling, data preprocessing, pattern recognition and decision making, etc. In this paper, we call this process as associative categorization, which is different from classical clustering, associative classification and associative cluster- ing. Focusing on representing the associations of behaviors and the corresponding uncertainties, we propose the method for constructing a Markov network (MN) from the results of frequent pattern mining, called item-associative Markov net- work (IAMN), where nodes and edges represent the frequent patterns and their associations respectively. We further dis- cuss the properties of a probabilistic graphical model to guar- antee the IAMN's correctness theoretically. Then, we adopt the concept of chordal to reflect the closeness of nodes in the IAMN. Adopting the algorithm for constructing join trees from an MN, we give the algorithm for IAMN-based associa- tive categorization by hierarchical bottom-up aggregations of nodes. Experimental results show the effectiveness, efficiency and correctness of our methods. 展开更多
关键词 frequent pattem behavior association associa-tive categorization Markov network hierarchical aggrega-tion
原文传递
上一页 1 2 5 下一页 到第
使用帮助 返回顶部