Starting from March 30, every Wednesday, Nikolay Pavlov, Data Scientist @ Azzurro.io, will hold a series of Data Analysis with R workshops, reports the IP-portal DOU.UA.
Data Analysis with R includes eight 2-hour sessions dedicated to Data Science.
Participants of the workshops will be able to learn about functional programming (Scala) and machine learning (R&Spark) absolutely free.
Organizer of training – Kharkiv Computer Science Group and the informal association of R&D companies and individual researchers.
Introduction to data
R programming language
Observations and variables
Relationship between variables
Population and sample
Dependent and independent variables
Experimental design and sampling methods
Data exploration, visualization and cleaning
Data import, cleaning and manipulations
Histogram, mean, variance and standard deviation
Box plots, quartiles, median and outliers
Categorical data, contingency tables and bar plot
Outcome, random process and Law of Large numbers
Disjoint/joint outcomes, addition rule
Conditional, marginal and joint probabilities
Random variables, Expected Value, Variance
Probability distributions: PDF, CDF
Type I, type II errors, power
Paired data, different of two means
Inference for categorical data
Linear regression and least squares (LS)
Conditions for fitting regression line
Residuals analysis, R^2
Interpretation and inference
Machine learning and Supervised learning
Regression / Classification
Gradient descent, SGD, mini-batches
Decision Trees, Random Forest, Neural Networks, SVM
Bias-Variance tradeoff, regularization L1/L2
BigData, R and Apache Spark
Resilient Distributed Datasets (RDD)
SparkR, Data Frame operations
Machine Learning in Spark
Where: Fabrika.space (Blagoveshchenska Street, 1).
When: every Wednesday from 30 March to 18 May
Time: 19:00 to 21:00
Price: free admission
Attention! Be registered and carry a laptop with pre-installed R language and IDE R-Studio.