New guidance for using t-SNE:Alternative defaults,hyperparameter selection automation,and comparative evaluation

导出

摘要 We present new guidelines for choosing hyperparameters for t-SNE and an evaluation comparing these guidelines to current ones.These guidelines include a proposed empirically optimum guideline derived from a t-SNE hyperparameter grid search over a large collection of data sets.We also introduce a new method to featurize data sets using graph-based metrics called scagnostics;we use these features to train a neural network that predicts optimal t-SNE hyperparameters for the respective data set.This neural network has the potential to simplify the use of t-SNE by removing guesswork about which hyperparameters will produce the best embedding.We evaluate and compare our neural network-derived and empirically optimum hyperparameters to several other t-SNE hyperparameter guidelines from the literature on 68 data sets.The hyperparameters predicted by our neural network yield embeddings with similar accuracy as the best current t-SNE guidelines.Using our empirically optimum hyperparameters is simpler than following previously published guidelines but yields more accurate embeddings,in some cases by a statistically significant margin.We find that the useful ranges for t-SNE hyperparameters are narrower and include smaller values than previously reported in the literature.Importantly,we also quantify the potential for future improvements in this area:using data from a grid search of t-SNE hyperparameters we find that an optimal selection method could improve embedding accuracy by up to two percentage points over the methods examined in this paper.

作者 Robert Gove Lucas Cadalzo Nicholas Leiby Jedediah M.Singer Alexander Zaitzeff

机构地区 Two Six Technologies

出处《Visual Informatics》 EI 2022年第2期87-97,共11页 可视信息学（英文）

基金 We thank Kevin Merchant,David Slater,and Reed Gordon-Sarney for their technical support and thoughtful discussion.

关键词 Dimensionality reduction Machine learning t-SNE

分类号 O15 [理学—基础数学]

引文网络
相关文献

1Bo Sun,Yibin Cheng,Yonghong Li,Xianliang Wang,Kangfeng Zhao,Xiaoyuan Yao,Lin Wang,Shilu Tong,Xiaoming Shi.Healthy Environment Promotion Campaign in Healthy China Initiative[J].China CDC weekly,2020,2(10):160-163.
2Daniel Azoulay,David Bomze,Tomer Meirson.The quest for optimal and reliable guidelines based on robust evidence for the treatment of cholangiocarcinoma[J].Hepatobiliary Surgery and Nutrition,2021,10(6):913-915.
3Ahmad Massoud Niazi,Mohammad Monir Tawfeeq,Amanullah Aziz,Jahid Zabuli,Shahpoor Rahmati,Abdul Razaq Irshad.Comparative Evaluation of Different Treatment for Purulent Wounds in Dogs[J].Open Journal of Veterinary Medicine,2016,6(7):119-126.
4Zhi Huang,Zhi Han,Tongxin Wang,Wei Shao,Shunian Xiang,Paul Salama,Maher Rizkalla,Kun Huang,Jie Zhang.TSUNAMI:Translational Bioinformatics Tool Suite for Network Analysis and Mining[J].Genomics, Proteomics & Bioinformatics,2021,19(6):1023-1031.
5Walter Eduardo Quezada-Yaguachi,Americo D. Rodriguez,Francisco Solís-Santoyo,Alma D. Lopez-Solis,William C. Black IV,Karla Saavedra-Rodriguez,Diego Morales-Viteri,Patricia Penilla-Navarro.Comparative Evaluation of the Regular Ovitrap vs an Innovated Larvitrap for <i>Aedes</i>Entomological Surveillance in Tapachula[J].Advances in Entomology,2022,10(1):77-84. 被引量：1
6Letícia Nascimento Medeiros Bortolon,Luciana de Paula Leão Triz,Bruna de Souza Faustino,Larissa Bianca Cunha de Sá,Denise Rosso Tenório Wanderley Rocha,Alberto Krayyem Arbex.Gestational Diabetes Mellitus: New Diagnostic Criteria[J].Open Journal of Endocrine and Metabolic Diseases,2016,6(1):13-19.
7Shaveta Gupta,Dinesh Grover,Ahmad Ali AlZubi,Nimit Sachdeva,Mirza Waqar Baig,Jimmy Singla.Machine Learning with Dimensionality Reduction for DDoS Attack Detection[J].Computers, Materials & Continua,2022(8):2665-2682.
8Rosacea Research Center,Chinese Society of Dermatology,Rosacea Professional Committee,Chinese Dermatologist Association,Heng Gu,Fei Hao,Wei He,Dan Jian,Zhe Jian,Xian Jiang,Qiang Ju,Xiao-Jing Kang,Wei Lai,Heng-Jin Li,Ji Li,Tie-Nan Li,Xin-Yu Lin,Wei Liu,Xiao-Hua Tao,Ben Wang,Hong-Fu Xie,Hong-Hui Xu,Yang Xu,Shu-Xian Yan,Jie Yang,Bo Yu.Guidelines for the Diagnosis and Treatment of Rosacea in China(2021 Edition)[J].International Journal of Dermatology and Venereology,2021,4(4):199-209. 被引量：3
9Romany F.Mansour,Sara A.Althubiti,Fayadh Alenezi.Computer Vision with Machine Learning Enabled Skin Lesion Classification Model[J].Computers, Materials & Continua,2022(10):849-864.
10Douadi Drihem.Variable Besov Spaces:Continuous Version[J].Journal of Mathematical Study,2019,52(2):178-226.

Visual Informatics

2022年第2期

浏览历史

内容加载中请稍等...

New guidance for using t-SNE:Alternative defaults,hyperparameter selection automation,and comparative evaluation

相关作者

相关机构

相关主题

浏览历史