期刊文献+
共找到93篇文章
< 1 2 5 >
每页显示 20 50 100
Ableism and (Neo)Racism in School Placement Processes in Quebec: School Personnel Interpretations of Immigrant Student Difficulties - A Secondary Publication
1
作者 Tya Collins Corina Borri-Anadon 《Journal of Contemporary Educational Research》 2024年第3期148-160,共13页
The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the sc... The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the school experiences of students from immigrant backgrounds or racialized groups.The research problem of this article concerns the identification of these students as disabled or as having adjustment or learning difficulties.From a perspective anchored in Disability Critical Race Studies,this ethnographic study documents different interpretations of perceived difficulties made by school actors with regard to seven primary school students from immigrant backgrounds.Five interpretation types are presented:(1)medicalization by dismissal of cultural markers,(2)medicalization by professional constraint,(3)medicalization by cultural deficit,(4)precautionary wait,and(5)cultural differentialism.Our results help to shed light on the special education overrepresentation phenomenon regarding these students and to understand how ableism and(neo)racism contribute to it. 展开更多
关键词 Categorization in education Learning difficulties and students in difficulty Immigration and ethnicity Educational inclusion and exclusion Canada
下载PDF
Smart Approaches to Efficient Text Mining for Categorizing Sexual Reproductive Health Short Messages into Key Themes
2
作者 Tobias Makai Mayumbo Nyirenda 《Open Journal of Applied Sciences》 2024年第2期511-532,共22页
To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved a... To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved access to information on various Sexual Reproductive Health topics through Short Messaging Service (SMS) messages. Over the years, the platform has accumulated millions of incoming and outgoing messages, which need to be categorized into key thematic areas for better tracking of sexual reproductive health knowledge gaps among young people. The current manual categorization process of these text messages is inefficient and time-consuming and this study aims to automate the process for improved analysis using text-mining techniques. Firstly, the study investigates the current text message categorization process and identifies a list of categories adopted by counselors over time which are then used to build and train a categorization model. Secondly, the study presents a proof of concept tool that automates the categorization of U-report messages into key thematic areas using the developed categorization model. Finally, it compares the performance and effectiveness of the developed proof of concept tool against the manual system. The study used a dataset comprising 206,625 text messages. The current process would take roughly 2.82 years to categorise this dataset whereas the trained SVM model would require only 6.4 minutes while achieving an accuracy of 70.4% demonstrating that the automated method is significantly faster, more scalable, and consistent when compared to the current manual categorization. These advantages make the SVM model a more efficient and effective tool for categorizing large unstructured text datasets. These results and the proof-of-concept tool developed demonstrate the potential for enhancing the efficiency and accuracy of message categorization on the Zambia U-report platform and other similar text messages-based platforms. 展开更多
关键词 Knowledge Discovery in Text (KDT) Sexual Reproductive Health (SRH) Text Categorization Text Classification Text Extraction Text Mining Feature Extraction Automated Classification Process Performance Stemming and Lemmatization Natural Language Processing (NLP)
下载PDF
A COMPARISON OF ALTERNATIVE CRITERIA FOR DEFINING FUZZY BOUNDARIES ON FUZZY CATEGORICAL MAPS 被引量:1
3
作者 ZHANG Jingxiong Roger P.Kirby 《Geo-Spatial Information Science》 2000年第2期26-34,共9页
This paper provides a brief introduction to the methods for generating fuzzy categorical maps from remotely sensed images (in graphical and digital forms).This is followed by a description of the slicing process for d... This paper provides a brief introduction to the methods for generating fuzzy categorical maps from remotely sensed images (in graphical and digital forms).This is followed by a description of the slicing process for deriving fuzzy boundaries from fuzzy categorical maps,which can be based on the maximum fuzzy membership values,confusion index,or measure of entropy.Results from an empirical test preformed in an Edinburgh suburb show that fuzzy boundaries of land cover can be derived from aerial photographs and satellite images by using the three criteria with small differences,and that slicing based on the maximum fuzzy membership values is the easiest and most straightforward solution.This,in turn,implies the suitability of maintaining both a crisp classification and its underlying certainty map for deriving fuzzy boundaries at different thresholds,which is a flexible and compact management of categorical map data and their uncertainty. 展开更多
关键词 categorical mapping objects FIELDS FUZZY categorical MAPS FUZZY MEMBERSHIP VALUES (FMVs) FUZZY boundaries
下载PDF
A Text Categorization System with Soft Real-Time Guarantee 被引量:1
4
作者 WANG Hua-yong CHEN Yu DAI Yi-qi 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期226-229,共4页
In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established... In order to provide predictable runtime performante for text categorization (TC) systems, an innovative system design method is proposed for soft real time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4 of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computa tion resources. The proposed methodology enables the theo retical analysis and evaluation to the TC systems, leading to a high-quality and low cost implementation approach. 展开更多
关键词 information retrieval text categorization soft real-time system feedback control theory
下载PDF
Mapping QTL for Categorical Traits with Multivariate Regression
5
作者 田佺 杨润清 《Journal of Shanghai Jiaotong university(Science)》 EI 2005年第S1期97-102,共6页
Simple linear regression analysis has been used to map QTL for quantitative traits. Many traits of biological interest and/or economical importance in various species show binary phenotypic distributions (e.g., presen... Simple linear regression analysis has been used to map QTL for quantitative traits. Many traits of biological interest and/or economical importance in various species show binary phenotypic distributions (e.g., presence or absence). It has been shown that such a binary trait also can be analyzed with the simple linear regression, subject to virtually no loss in power compared to the generalized linear model analysis. Binary trait is a special case of a multiple categorical trait (e.g., low, medium or high). We propose a mechanism to decompose a multiple categorical trait into an array of correlated binary variables. The categorical trait turned multiple binary traits are analyzed with a multivariate linear regression method. Turning the problem of categorical trait mapping into that of multivariate mapping allows the exploration of pleiotropic effects of QTL for different categories. Efficiency of the method is verified through a series of simulation experiments. 展开更多
关键词 categorical TRAIT MAPPING QTL MULTIVARIATE linear regression analysis
下载PDF
Coupled Attribute Similarity Learning on Categorical Data for Multi-Label Classification
6
作者 Zhenwu Wang Longbing Cao 《Journal of Beijing Institute of Technology》 EI CAS 2017年第3期404-410,共7页
In this paper a novel coupled attribute similarity learning method is proposed with the basis on the multi-label categorical data(CASonMLCD).The CASonMLCD method not only computes the correlations between different ... In this paper a novel coupled attribute similarity learning method is proposed with the basis on the multi-label categorical data(CASonMLCD).The CASonMLCD method not only computes the correlations between different attributes and multi-label sets using information gain,which can be regarded as the important degree of each attribute in the attribute learning method,but also further analyzes the intra-coupled and inter-coupled interactions between an attribute value pair for different attributes and multiple labels.The paper compared the CASonMLCD method with the OF distance and Jaccard similarity,which is based on the MLKNN algorithm according to 5common evaluation criteria.The experiment results demonstrated that the CASonMLCD method can mine the similarity relationship more accurately and comprehensively,it can obtain better performance than compared methods. 展开更多
关键词 COUPLED SIMILARITY MULTI-LABEL categorical data CORRELATIONS
下载PDF
A Graph Drawing Algorithm for Visualizing Multivariate Categorical Data
7
作者 HUANG Jingwei HUANG Jie 《Wuhan University Journal of Natural Sciences》 CAS 2007年第2期239-242,共4页
In this paper, a new approach for visualizing multivariate categorical data is presented. The approach uses a graph to represent multivariate categorical data and draws the graph in such a way that we can identify pat... In this paper, a new approach for visualizing multivariate categorical data is presented. The approach uses a graph to represent multivariate categorical data and draws the graph in such a way that we can identify patterns, trends and relationship within the data. A mathematical model for the graph layout problem is deduced and a spectral graph drawing algorithm for visualizing multivariate categorical data is proposed. The experiments show that the drawings by the algorithm well capture the structures of multivariate categorical data and the computing speed is fast. 展开更多
关键词 multivariate categorical data GRAPH graph drawing ALGORITHMS
下载PDF
Clustering Categorical Data Based on Within-Cluster Relative Mean Difference
8
作者 Jinxia Su Chunjing Su 《Open Journal of Statistics》 2017年第2期173-181,共9页
The clustering on categorical variables has received intensive attention. In dataset with categorical features, some features show the superior performance on clustering procedure. In this paper, we propose a simple m... The clustering on categorical variables has received intensive attention. In dataset with categorical features, some features show the superior performance on clustering procedure. In this paper, we propose a simple method to find such distinctive features by comparing pooled within-cluster mean relative difference and then partition the data upon such features and give subspace of the subgroups. The applications on zoo data and soybean data illustrate the performance of the proposed method. 展开更多
关键词 CLUSTERING categorical Variable Distinctive Attribute Pooled Within-Cluster Mean RELATIVE DIFFERENCE Hamming Distance
下载PDF
Analysis of Extension Categorical Data Mining Process for the Extension Interior Designing
9
作者 Hui Ma Guangtian Zou 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2016年第6期26-31,共6页
On the basis of extension architectonics,this paper researches the process of extension categorical data mining for extension interior design. In accordance with the theory of extension data mining,the extension categ... On the basis of extension architectonics,this paper researches the process of extension categorical data mining for extension interior design. In accordance with the theory of extension data mining,the extension categorical data mining for the extension interior design can be divided into data preparation,the operation of mining and knowledge application. The paper expatiates the main content and cohesive relations of each link,and emphatically discusses extension acquisition,analysis extension,categorical mining extension,knowledge application extension and other several core nodes that are related with data. Through the knowledge fusion of extension architectonics and data mining,the paper discusses the process of knowledge requirements with multiple classification under different mining targets. The purpose of this paper is to explore a whole categorical data mining process of interior design from extension design data to the design of knowledge discovery and extension application. 展开更多
关键词 extension categorical data mining extension sets extension interior design
下载PDF
Dimensional(premenstrual symptoms screening tool)vs categorical(mini diagnostic interview,module U)for assessment of premenstrual disorders
10
作者 Rifka Chamali Rana Emam +1 位作者 Ziyad R Mahfoud Hassen Al-Amin 《World Journal of Psychiatry》 SCIE 2022年第4期603-614,共12页
BACKGROUND Premenstrual syndrome(PMS)is the constellation of physical and psychological symptoms before menstruation.Premenstrual dysphoric disorder(PMDD)is a severe form of PMS with more depressive and anxiety sympto... BACKGROUND Premenstrual syndrome(PMS)is the constellation of physical and psychological symptoms before menstruation.Premenstrual dysphoric disorder(PMDD)is a severe form of PMS with more depressive and anxiety symptoms.The Mini international neuropsychiatric interview,module U(MINI-U),assesses the diagnostic criteria for probable PMDD.The Premenstrual Symptoms screening tool(PSST)measures the severity of these symptoms.AIM To compare the PSST ordinal scores with the corresponding dichotomous MINI-U answers.METHODS Arab women(n=194)residing in Doha,Qatar,received the MINI-U and PSST.Receiver Operating Characteristics(ROC)analyses provided the cut-off scores on the PSST using MINI-U as a gold standard.RESULTS All PSST ratings were higher in participants with positive responses on MINI-U.In addition,ROC analyses showed that all areas under the curves were significant with the cutoff scores on PSST.CONCLUSION This study confirms that the severity measures from PSST can recognize patients with moderate/severe PMS and PMDD who would benefit from immediate treatment. 展开更多
关键词 Premenstrual symptoms screening tool Premenstrual dysphoric disorder ARABS categorical vs dimensional classification
下载PDF
Combined Use of k-Mer Numerical Features and Position-Specific Categorical Features in Fixed-Length DNA Sequence Classification
11
作者 Dau Phan Ngoc Giang Nguyen +6 位作者 Favorisen Rosyking Lumbanraja Mohammad Reza Faisal Bahriddin Abapihi Bedy Purnama Mera Kartika Delimayanti Mamoru Kubo Kenji Satou 《Journal of Biomedical Science and Engineering》 2017年第8期390-401,共12页
To classify DNA sequences, k-mer frequency is widely used since it can convert variable-length sequences into fixed-length and numerical feature vectors. However, in case of fixed-length DNA sequence classification, s... To classify DNA sequences, k-mer frequency is widely used since it can convert variable-length sequences into fixed-length and numerical feature vectors. However, in case of fixed-length DNA sequence classification, subsequences starting at a specific position of the given sequence can also be used as categorical features. Through the performance evaluation on six datasets of fixed-length DNA sequences, our algorithm based on the above idea achieved comparable or better performance than other state-of-the art algorithms. 展开更多
关键词 Sequence CLASSIFICATION NUMERICAL and categorical FEATURES Feature Selection
下载PDF
On Edge Irregular Reflexive Labeling of Categorical Product of Two Paths
12
作者 Muhammad Javed Azhar Khan Muhammad Ibrahim Ali Ahmad 《Computer Systems Science & Engineering》 SCIE EI 2021年第3期485-492,共8页
Among the huge diversity of ideas that show up while studying graph theory,one that has obtained a lot of popularity is the concept of labelings of graphs.Graph labelings give valuable mathematical models for a wide s... Among the huge diversity of ideas that show up while studying graph theory,one that has obtained a lot of popularity is the concept of labelings of graphs.Graph labelings give valuable mathematical models for a wide scope of applications in high technologies(cryptography,astronomy,data security,various coding theory problems,communication networks,etc.).A labeling or a valuation of a graph is any mapping that sends a certain set of graph elements to a certain set of numbers subject to certain conditions.Graph labeling is a mapping of elements of the graph,i.e.,vertex and for edges to a set of numbers(usually positive integers),called labels.If the domain is the vertex-set or the edge-set,the labelings are called vertex labelings or edge labelings respectively.Similarly,if the domain is V(G)[E(G)],then the labeling is called total labeling.A reflexive edge irregular k-labeling of graph introduced by Tanna et al.:A total labeling of graph such that for any two different edges ab and a'b'of the graph their weights has wt_(x)(ab)=x(a)+x(ab)+x(b) and wt_(x)(a'b')=x(a')+x(a'b')+x(b') are distinct.The smallest value of k for which such labeling exist is called the reflexive edge strength of the graph and is denoted by res(G).In this paper we have found the exact value of the reflexive edge irregularity strength of the categorical product of two paths (P_(a)×P_(b))for any choice of a≥3 and b≥3. 展开更多
关键词 Edge irregular reflexive labeling reflexive edge strength categorical product of two paths
下载PDF
On the Matrices of Pairwise Frequencies of Categorical Attributes for Objects Classification
13
作者 Vladimir N. Shats 《Journal of Intelligent Learning Systems and Applications》 2019年第4期65-75,共11页
This paper proposes two new algorithms for classifying objects with categorical attributes. These algorithms are derived from the assumption that the attributes of different object classes have different probability d... This paper proposes two new algorithms for classifying objects with categorical attributes. These algorithms are derived from the assumption that the attributes of different object classes have different probability distributions. One algorithm classifies objects based on the distribution of the attribute frequencies, and the other classifies objects based on the distribution of the pairwise attribute frequencies described using a matrix of pairwise frequencies. Both algorithms are based on the method of invariants, which offers the simplest dependencies for estimating the probabilities of objects in each class by an average frequency of their attributes. The estimated object class corresponds to the maximum probability. This method reflects the sensory process models of animals and is aimed at recognizing an object class by searching for a prototype in information accumulated in the brain. Because these matrices may be sparse, the solution cannot be determined for some objects. For these objects, an analog of the k-nearest neighbors method is provided in which for each attribute value, the class to which the majority of the k-nearest objects in the training sample belong is determined, and the most likely class value is calculated. The efficiencies of these two algorithms were confirmed on five databases. 展开更多
关键词 categorical Attributes Classification ALGORITHMS INVARIANTS of Matrix DATA DATA Processing
下载PDF
某三级甲等专科医院推进互联网分级诊疗的思考 被引量:2
14
作者 谢诗蓉 叶卿云 +4 位作者 王晨颖 陈琼洲 史晓诞 丁晓璟 叶正强 《中国卫生资源》 CSCD 北大核心 2023年第4期393-396,403,共5页
复旦大学附属眼耳鼻喉科医院在发展互联网医院的基础上,积极探索建立互联网分级诊疗。与全国6省(市)11个地区的16家医院达成互联网分级诊疗战略合作,利用专科医院优势,通过远程会诊、远程教学、科普直播等多项举措共同推进互联网分级诊... 复旦大学附属眼耳鼻喉科医院在发展互联网医院的基础上,积极探索建立互联网分级诊疗。与全国6省(市)11个地区的16家医院达成互联网分级诊疗战略合作,利用专科医院优势,通过远程会诊、远程教学、科普直播等多项举措共同推进互联网分级诊疗。在推进互联网分级诊疗的过程中遇到了远程需求较少、药品配送困难、精准转诊率低、费用结算不明、利益分配不均等难点。应积极尝试从加强医疗联合体合作关系、增加医疗联合体药品目录、加强基层培训、完善医疗联合体利益分配和构建当地眼健康档案等方面进一步完善互联网分级诊疗。 展开更多
关键词 专科医院specialist hospital 互联网医院internet hospital 分级诊疗categorized treatment
下载PDF
Validating Intrinsic Factors Informing E-Commerce: Categorical Data Analysis Demo
15
作者 Anthony Joe Turkson John Awuah Addor Douglas Yenwon Kharib 《Open Journal of Statistics》 2021年第5期737-758,共22页
Statistics is a powerful tool for data measurement. Statistical techniques properly planned and executed give meaning to meaningless data. The difficulty some practitioners encounter hinges on the fact that though the... Statistics is a powerful tool for data measurement. Statistical techniques properly planned and executed give meaning to meaningless data. The difficulty some practitioners encounter hinges on the fact that though there are numerous statistical methods available for use in analysis, the extent of their understanding and ease of using these tools for analysis is limited. This study has twofold purpose: firstly, literature on categorical data commonly used in research w</span><span style="font-family:Verdana;">as</span><span style="font-family:Verdana;"> reviewed</span><span style="font-family:Verdana;">;</span><span style="font-family:""><span style="font-family:Verdana;"> next, we reported the results of a survey we designed and executed. Categorical data was collected via questionnaire and analyzed to serve as a backbone of the robustness of categorical data. Several conjec</span><span style="font-family:Verdana;">tures about the independence of the socio-economic variables and e-commence</span><span style="font-family:Verdana;"> were tested. Some of the factors influencing patronage of e-commerce were </span><span style="font-family:Verdana;">identified. It is clear from the literature that as one’s academic qualification</span><span style="font-family:Verdana;"> improves</span></span><span style="font-family:Verdana;">, </span><span style="font-family:""><span style="font-family:Verdana;">there is an associated improvement in their preference for e-commerce, but the results revealed otherwise. Size of family was found to influence e-commerce. Both income and social status positively affected pa</span><span style="font-family:Verdana;">tronage in e-commerce. Gender also appeared to affect patronage in e-commerce</span><span style="font-family:Verdana;">. 62.3% of staff had patronized e-commerce</span></span><span style="font-family:Verdana;">.</span><span style="font-family:Verdana;"> This shows that e-commerce patronage was gradually increasing. It is therefore our considered view that policy documents regulating and monitoring the use of e-commerce be developed to increase e-commerce participation across the globe</span><span style="font-family:Verdana;">. </span><span style="font-family:Verdana;">It is also recommended that the bottlenecks which obstruct patronage in e-commence be addressed so that a lot more staff will develop a positive attitude towards e-commerce. 展开更多
关键词 categorical Data CHI-SQUARE E-COMMERCE Ordinal Data Nominal Data
下载PDF
The Grammatical Categorization of Mandarin在/zài:Spatiality,Temporality and Semantic Construal
16
作者 LIU Xing 《Journal of Literature and Art Studies》 2023年第4期295-303,共9页
Mandarin在(pinyin:zài)is the most frequently used character in representing spatial and temporal relationship.Current studies mostly focus on its lexical meaning and syntactic structure while cognitive features o... Mandarin在(pinyin:zài)is the most frequently used character in representing spatial and temporal relationship.Current studies mostly focus on its lexical meaning and syntactic structure while cognitive features of its grammatical categories have been neglected.This paper investigates into the categorization of zài by conducting a morphosyntactic test among College English majors in China.The results show that:prototypes are organizing the grammatical categories of zài at all levels in terms of intra-categorial gradience;the semantic construal of zài construction could significantly influence the accuracy of the grammatical categorization of zài;the syntactic structure can provide viable cue for the identification of grammatical categories of zài;spatiality,temporality and the status of existing are three essential semantic features encoded by zài,the concurrence of which leads to various degree of inter-categorial vagueness,indicating a conflict between the rigid grammatical classification and the indeterminate nature of the grammatical functions of zai,suggesting the necessity to reconsider the efficacy of applying indiscriminately the Anglo-Saxon grammar into the study of Chinese spatial-temporal constructions. 展开更多
关键词 Mandarin zài grammatical categorization TEMPORALITY SPATIALITY semantic construal
下载PDF
A Study on Second Language Vocabulary Acquisition Under the Categorization Theory
17
作者 Ting Xiao 《Journal of Contemporary Educational Research》 2023年第12期142-150,共9页
In cognitive linguistics,debates on the status and functions of categorization have been a heated issue.In semantics and second language acquisition,scholars have discussed and achieved vocabulary acquisition from dif... In cognitive linguistics,debates on the status and functions of categorization have been a heated issue.In semantics and second language acquisition,scholars have discussed and achieved vocabulary acquisition from different perspectives and academic levels.Vocabulary learning exerts a fundamental role in second language vocabulary acquisition(SLVA),and it is closely related to learners’cognitive competence.However,studies on second language vocabulary acquisition under the categorization theory in cognitive linguistics have received less attention from linguists when compared with other studies.This paper employs two representative dimensions,the basic-level effect and the prototype effect,under the categorization theory to further delve into the implications on second language vocabulary acquisition.This article first provides a comprehensive introduction to the nature and the approaches of the categorization theory,and then analyzes the relations and implications for second language vocabulary acquisition under the categorization theory from the perspective of the basic-level and the prototype effects.The research results showed that the basic-level effect on SLVA is mainly on the classification of word categories distinguished from the superordinate and subordinate categories,while the prototype effect is more on understanding the complexity and use of word meaning. 展开更多
关键词 CATEGORIZATION Second language vocabulary acquisition Basic-level PROTOTYPE
下载PDF
认知过程的翻译理论研究 被引量:6
18
作者 李平 《上海翻译》 CSSCI 北大核心 1999年第4期14-15,共2页
The recent developments of cognitive theories may provide a better interpretation for studies of translation rather than a description.The paper tries to put categorization and metaphor into the process of translating... The recent developments of cognitive theories may provide a better interpretation for studies of translation rather than a description.The paper tries to put categorization and metaphor into the process of translating and translators’ psychology so as to produce a more powerful interpretation. [ 展开更多
关键词 COGNITION INTERPRETATION CATEGORIZATION METAPHOR
下载PDF
A New Approach of Feature Selection for Text Categorization 被引量:6
19
作者 CUI Zifeng XU Baowen +1 位作者 ZHANG Weifeng XU Junling 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1335-1339,共5页
This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of e... This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of each other, widely used in the probabilistic models for text categorization (TC), is discussed. However, the basic hypothesis is incom plete for independence of feature set. From the view of feature selection, a new independent measure between features is designed, by which a feature selection algorithm is given to ob rain a feature subset. The selected subset is high in relevance with category and strong in independence between features, satisfies the basic hypothesis at maximum degree. Compared with other traditional feature selection method in TC (which is only taken into the relevance account), the performance of feature subset selected by our method is prior to others with experiments on the benchmark dataset of 20 Newsgroups. 展开更多
关键词 feature selection independency CHI square test text categorization
下载PDF
Comparison of Text Categorization Algorithms 被引量:4
20
作者 SHIYong-feng ZHAOYan-ping 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期798-804,共7页
This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classi... This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classifying algorithms in different fields. Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out. Key words text categorization - naive bayes - KNN - SVM - neural network CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (70031010) and the Research Foundation of Beijing Institute of TechnologyBiography: SHI Yong-feng (1980-), male, Master candidate, research direction: web information mining. 展开更多
关键词 text categorization naive bayes KNN SVM neural network
下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部