This course introduces some of most important and popular techniques in data-mining applications with R.
Data mining is the computational process of discovering patterns in large data sets.
During the two-days course we will reviw a wide variety of techniques to catch information from big amount of data: Dimensionality reduction, Clustering, Classification and Prediction examples will be presented and deepened.
The course will start with an introduction to basic methods for data description. After that, we will review the most popular techniques for data/dimensionality reduction, as Multidimensional Scaling, Principal Components Analysis, Correspondence Analysis. Next, we will focus on methods for searching for “natural subgroups” within data, as Hierachical/non hierarchical Cluster Analysis, Gaussian Mixtures Models.
The end of first day and the beginning of second day will present techniques for classification analysis (Linear/Quadratic Discriminant Analysis, Logistic Regression, K-Nearest-Neighborhood,…).
Finally, in remaining part of second day, we will review some techniques for variables selection, collinearity reduction, and best prediction for regression models (PCA regresssion, Ridge Regression, Lasso Regression, Elastic-Net regression, ..)
- Univariate Descriptive Statistics
- Reduction of Data Dimensions (MDS, PCA and EFA, CA)
- Clustering (HC, NHC, GMM)
- Classification (LDA, CLASS, KNN)
- Prediction (Several techniques to model data)
The two-days course costs euro 800 + VAT.
Should I take this course?
This class will be a good fit for you if you are already using R and you want an overview of data-mining techniques with R. Some background in theoretical statistics, probability, linear and logistic regression is required.
What does the cost include?
The cost includes lunch, comprehensive course materials + 1 hour of individual online post course support for each student within 30 days from course date.
There is a students discount?
We offer an academic discount for those engaged in full time studies or research. Please contact us for further information at firstname.lastname@example.org
There is a group discount?
We offer a group discount for people coming from the same company or organization. Please contact us for further information at email@example.com
What should I bring?
A laptop with the latest version of R and R-Studio.
Who will I learn from?
Enrico Pegoraro works in R training and consulting activities, with a special focus on Six Sigma, industrial statistical analysis and corporate training courses. Enrico graduated in Statistics from the University of Padua.
He has taught statistical models and R for hundreds of hours during specialized and applied courses, in universities, masters and companies.
What language is the course taught?
This course is taught in italian. Course material in English language
How can I reach your place?
Legnano is about 30 min by train from Milano. Trains from Milano to Legnano are scheduled every 30 minutes, and Quantide premises are 3 walking minutes from Legnano train station.
What is the minimum number of participants for this course?
A minimum number of 3 participants is required for the course to take place. If the minimum number of participants is not reached, we will refund all registration fees for those who signed up.
How can I contact you if I have further questions?
You can contact us at firstname.lastname@example.org