ASExplorer:基于联合熵的多维相关性可视分析系统ASExplorer:Multi-dimensional Correlation Visual Analysis System Based on Joint Entropy
张迪;杨沛;邓鑫波;赵千川;
摘要(Abstract):
数据维度相关性分析一直是数据分析领域的研究重点。传统的可视化方法可通过图形描述直观判断几个数据维度存在何种相关关系,但是难以解决维数灾难问题。一些数据挖掘方法虽然可行,但是难以把过程具象化,并且在一些应用场景下仍然需要可视化方法提供参数指导。提出了ASExplorer:一个探索高维数据维度相关性为目的的可视分析系统。该系统首先基于联合熵的维度重要性评价算法,帮助用户选择分析路径和过滤数据,然后基于以采样尺度为中心的交互探索方法,令用户可以同时探索多个数据维度在采样尺度变化时的关联关系。该系统适用于缺乏先验知识的数据集的早期分析过程,案例分析和用户研究验证了该系统的有效性。
关键词(KeyWords): 高维数据;联合熵;数据可视化;可视分析
基金项目(Foundation):
作者(Author): 张迪;杨沛;邓鑫波;赵千川;
Email:
DOI:
参考文献(References):
- [1] PHONG N,CAGATAY T,GENNADY A.Understanding user behaviour through action sequences:From the usual to the unusual[J].IEEE Transactions on Visualization and Computer Graphics,2019,25(9):2838-2852.
- [2] GUNDOGDU E,ALATAN A A.Good features to correlate for visual tracking[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2018,27(5):2526-2540.
- [3] KEIM D A,KRIEGEL H P,SEIDL T.Visual feedback in querying large databases[C]//Proceedings of IEEE Conference on Visualization,1993.
- [4] KEOGH E,MUEEN A.Curse of dimensionality[J].Ind Eng Chem,2009,29(1):48-53.
- [5] HU Q,YU D,XIE Z.Information-preserving hybrid data reduction based on fuzzy-rough techniques[J].Pattern Recognition Letters,2006,27(5):414-423.
- [6] THEUS M.High-dimensional data visualization[M].Berlin Heidelberg:Springer-Verlag,2008.
- [7] TAO J,XU J,WANG C,et al.HoNVis:Visualizing and exploring higher-order networks[C]//Proceedings of IEEE Pacific Visualization Symposium,2017.
- [8] KAIRAM S,MACLEAN D,SAVVA M,et al.GraphPrism:Compact visualization of network structure[C]//Proceedings of International Conference on Advanced Visual Interfaces,2012:498-505.
- [9] CUNNINGHAM J P,GHAHRAMANI Z.Linear dimensionality reduction:Survey,insights,and generalizations[J].arXiv:1406.0873,2014.
- [10] BAIN A,DAN C.Fundamentals of stochastic filtering[M].New York:Springer,2009.
- [11] PRAMOKCHON P,PIAMSA-NGA P.Effective threshold estimation for filter-based feature selection[C]//Proceedings of International Computer Science and Engineering Conference(ICSEC),2016.
- [12] BETTINI C,JAJODIA S,WANG S.Time granularities in databases,data mining,and temporal reasoning[M].Berlin Heidelberg:Springer-Verlag,2000.
- [13] LIU Z,JIANG B,HEER J.imMens:Real-time visual querying of big data[J].Computer Graphics Forum,2013,32:421-430.
- [14] KEIM D A,KRIEGEL H P.Visualization techniques for mining large databases:A comparison[J].IEEE Transactions on Knowledge and Data Engineering,1996,8(6):923-938.
- [15] SHAO L,SILVA N,EGGELING E,et al.Visual exploration of large scatter plot matrices by pattern recommendation based on eye tracking[C]//Proceedings of 2017 ACM Workshop,2017.
- [16] INSELBERG A.Parallel coordinates[M].New York:Springer,2009.
- [17] RONALD M,PICKETT G G H L.Harnessing preattentive perceptual processes in visualization[J].Berlin Heidelberg:Springer-Verlag,1995:33-45.
- [18] MORRIS C J,EBERT D S.An experimental analysis of the effectiveness of features in chernoff faces[C]//Proceedings of SPIE,2000:12-17.
- [19] KEIM A D.Designing pixel-oriented visualization techniques:Theory and applications[J].IEEE Transactions on Visualization and Computer Graphics,2000,6(1):78.
- [20] TUFTE E R.The visual display of quantitative information[M].2nd Ed.[S.l.]:Graphics Pr,2001.
- [21] WARD M O,GRINSTEIN G G,KEIM D A.Interactive data visualization-foundations,techniques,and applications[M].[S.l.]:A K Peters,2010.
- [22] BUTKIEWICZ W D Z W.Multi-focused geospatial analysis using probes[J].IEEE Transactions on Visualization and Computer Graphics,2008,14(6):1165-1172.
- [23] FERREIRA N,POCO J,VO H T,et al.Visual exploration of big spatio-temporal urban data:A study of New York City Taxi Trips[J].IEEE Transactions on Visualization&Computer Graphics,2013,19(12):2149-2158.
- [24] TURKAY C,SLINGSBY A,HAUSER H,et al.Attribute signatures:Dynamic visual summaries for analyzing multivariate geographical data[J].IEEE Trans on Vis Comput Graph,2014,20(12):2033-2042.
- [25] SMILDEAB B R.Principal component analysis[J].Analytical Methods,2014,6(9):2812-2831.
- [26] POTTER K,ROSEN P,JOHNSON C R.From quantification to visualization:A taxonomy of uncertainty visualization approaches[J].IFIP Advances in Information&Communication Technology,2012,377:226.
- [27] HAZARIKA S,BISWAS A,DUTTA S,et al.Information guided exploration of scalar values and isocontours in ensemble datasets[J].Entropy,2018,20(7):540.
- [28] HOLLIMAN N S,COLTEKIN A,FERNSTAD S J,et al.Visual Entropy and the Visualization of Uncertainty[J].arXiv:1907.12879,2019.
- [29] LU K,SHEN H.A compact multivariate histogram representation for query-driven visualizaion[C]//Proceedings of IEEE Symposium on Large Data Analysis&Visualization,2015:49-56.
- [30] HAZARIKA S,DUTTA S,SHEN H W.Visualizing the variations of ensemble of isosurfaces[C]//Proceedings of IEEE Pacific Visualization Symposium(PacificVis),2016:209-213.
- [31] B?ASZCZY?SKI J,STEFANOWSKI J.Local data characteristics in learning classifiers from imbalanced data[C]//Advances in Data Analysis with Computational Intelligence Methods,2018.
- [32] HECHT B,MOXLEY E.Terabytes of Tobler:Evaluating the first law in a massive,domain-neutral representation of world knowledge[C]//Proceedings of the 9th International Conference on Spatial Information Theory,2009.
- [33]孙国道,梁荣华,何贤国,等.高维时空房地产数据的可视分析[J].计算机辅助设计与图形学学报,2013,25(8):1169-1176.
- [34] BETTINI C,JAJODIA S,WANG S.Time granularities in databases,data mining,and temporal reasoning[M].[S.l.]:Springer Science&Business Media,2013.
- [35] LAM N S,QUATTROCHI D A.On the issues of scale,resolution,and fractal analysis in the mapping sciences*[J].Professional Geographer,2010,44(1):88-98.
- [36] KOYTEK P,PERIN C,VERMEULEN J,et al.MyBrush:Brushing and linking with personal agency[J].IEEE Transactions on Visualization&Computer Graphics,2018,24(1):605-615.
- [37] DYKES J,BRUNSDON C.Geographically weighted visualization:Interactive graphics for scale-varying exploratory analysis[J].IEEE Transactions on Visualization&Computer Graphics,2007,13(6):1161.
- [38] DUANY A.Introduction to the special issue:The Transect[J].Journal of Urban Design,2002,7(3):251-260.
- [39]杨文君.属性约简方法在入侵检测技术中的应用研究[D].哈尔滨:哈尔滨工程大学,2009.