This volume comprises papers dedicated to data science and the extraction of knowledge from many types of data: structural, quantitative, or statistical approaches for the analysis of data; advances in classification, clustering and pattern recognition methods; strategies for modeling complex data and mining large data sets; applications of advanced methods in specific domains of practice. The contributions offer interesting applications to various disciplines such as psychology, biology, medical and health sciences; economics, marketing, banking and finance; engineering; geography and geology; archeology, sociology, educational sciences, linguistics and musicology; library science. The book contains the selected and peer-reviewed papers presented during the European Conference on Data Analysis (ECDA 2013) which was jointly held by the German Classification Society (Gf Kl) and the French-speaking Classification Society (SFC) in July 2013 at the University of Luxembourg.
Daftar Isi
Part I Invited Papers: Modernising Official Statistics – A Complex Challenge.- A New Supervised Classification of Credit Approval Data via the Hybridized RBF Neural Network Model Using Information Complexity.- Finding the Number of Disparate Clusters with Background Contamination.- Clustering of Solar Irradiance.- Part II Data Science and Clustering: Factor Analysis of Local Formalism.- Recent progress in Complex Network Analysis – Models.- Recent Progress in Complex Network Analysis – Results.- Similarity Measures of Concept Lattices.- Flow-Based Dissimilarities: Shortest Path, Commute Time, Max-Flow and Free Energy.- Resampling Techniques in Cluster Analysis – Is Subsampling Better Than Bootstrapping?.- On-Line Clustering of Functional Boxplots for Monitoring Multiple Streaming Time Series.- Smooth Tests of Fit for Gaussian Mixtures.- Part III Machine Learning and Knowledge Discovery: P2P RVM for Distributed Classification.- Selecting a Multi-Label Classification Method for an Interactive System.- Visual Analysis of Topics in Twitter Based on Co-Evolution of Terms.- Incremental Weighted Naive Bayes Classifiers for Data Stream.- SVM Ensembles are Better When Different Kernel Types are Combined.- Part IV Data Analysis in Marketing: Ratings-Based Versus Choice-Based Conjoint Analysis for Predicting Choices.- A Statistical Software Package for Image Data Analysis in Marketing.- The Bass Model as Integrative Diffusion Model – A Comparison of Parameter Influences.- Preference Measurement in Complex Product Development – A Comparison of Two-Staged SEM Approaches.- Combination of Distances and Image Features for Clustering Image Data Bases.- A Game Theoretic Product Design Approach Considering Stochastic Partworth Functions.- Key Success-Determinants of Crowdfunded Projects – An Exploratory Analysis.- Preferences Interdependence among Family Members – Case III/APIM Approach.- Part V Data Analysis in Biostatistics and Bioinformatics: Evaluation for Cell Line Suitability for Disease Specific Perturbation Experiments.- Effect of Hundreds Sequenced Genomes on the Classification of Human Papilloma Viruses.- Donor Limited Hot Deck Imputation – A Constrained Optimization Problem.- Classification and Data Set Analysis Using Ensembles of Representative Prototype Sets.- Event Prediction in Pharyngeal High-Resolution Manometry.- Edge Selection in a Noisy Graph by Concept Analysis – Application to a Genomic Network.- Part VI Data Analysis in Education and Psychology: Linear Modelling of Differences in Teacher Judgment Formation of School Tracking Recommendations.- Psychometric Challenges in Modeling Scientific Problem-Solving Competency – An Item Response Theory Approach.- The Luxembourg Teacher Databank 1845-1939. Academic Research into the Social History of the Luxembourg Primary School Teaching Staff.- Part VII Data Analysis in Musicology: Correspondence Analysis, Cross-Autocorrelation and Clustering in Polyphonic Music.- Impact of Frame Size and Instrumentation on Chroma-Based Automatic Chord Recognition.- Interpretable Music Categorisation Based on Fuzzy Rules and High-Level Audio Features.- Part VIII Data Analysis in Communication and Technology: What is in a Like? Preference Aggregation on the Social Web.- Predicting Micro-Level Behavior in Online Communities for Risk Management.- Human Performance Profiling While Driving a Sidestick-Controlled Car.- Multivariate Landing Page Optimization Using Hierarchical Bayes CBC Analysis.- Hellinger Distance Based Feature Construction with Applications to the FACT Experiment.- Part IX Data Analysis in Administration and Spatial Planning: Hough Transform and Kirchhoff Migration for Supervised GPR Data Analysis.- Application of Hedonic Methods in Modelling Real Estate Prices in Poland.- Smart Growth Path as the Basis for the European Union Countries Typology.- The Influence of Upper Level NUTS on Lower Level Classification of EU Regions.- Part X Data Analysis in Library Science: Multilingual Subject Retrieval – Bibliotheca Alexandrina’s Subject Authority File and Linked Subject Data.- The Vu Find Based ‘MT-Katalog’ – A Customized Music Library Service at the University of Music and Drama Leipzig.