A beginner’s gateway to data analysis with R
Introduction: Overcoming the Fear of Coding
The mindset shift: From apprehension to confidence
Why R and RStudio?
Course structure and expectations
Session 1: RStudio Infrastructure (July 21, 2025 | 4 hours)
Installing R and RStudio
Navigating the interface: Script, Console, Environment, and Help windows
Customizing layouts and shortcuts
How to access help (?function, R Documentation, online forums)
Core data types: numeric, character, logical, factors
Data structures: vectors, lists, matrices, and data frames
Checking and converting data types
Hands-on exercises: Creating and inspecting structures
Session 2: Data Manipulation (July 22, 2025 | 4 hours)
Subsetting and filtering data
Reshaping data: gather() and spread() (from tidyr)
Handling missing values
Introduction to dplyr verbs (select(), filter(), mutate())
Hands-on exercises: Data Manipulation
Descriptive statistics (mean, median, variance)
Basic visualizations: histograms, boxplots, scatter plots (ggplot2)
Identifying patterns and outliers
Hands-on exercises: Exploring a dataset
Session 3: Deep dive into Data Visualization (July 23, 2025 | 4 hours)
Principles of Effective Visualization
Overview of R’s Visualization Systems (Base R, Lattice, GGPLOT2)
Lattice Package
Basics: Syntax and Formula Interface (~x | g)
Key Plot Types: Scatter, Histogram, Boxplot, Density
Multi-Panel Conditioning (facet_wrap, facet_grid)
Customization: Colors, Axes, and Legends
GGPLOT2 Package
Grammar of Graphics (Layers: Data, Aesthetics, Geoms)
Essential Geoms: geom_point(), geom_line(), geom_bar()
Themes, Scales, and Annotations
Interactive Visuals with plotly/shiny (Optional)
Exporting Plots (PDF/PNG) & Reproducibility
Mini-Projects: Real-World Datasets
Session 4: Statistical Data Analysis (July 24, 2025 | 4 hours)
Hypothesis testing:
Parametric and non-parametric tests
Correlation analysis
Interpreting and presenting results
Hands-on exercises: Test selected hypotheses
Session 5: Linear Regression Analysis (July 25, 2025 | 4 hours)
From correlation to simple linear regression
Multiple Linear Regression
Model Selection and Interpretation
Packages for advanced analysis (e.g., lme4, caret)
R Markdown for reproducible reports
Rstudio interphase
Data Wrangling
Descriptive Statistics
Statistical models
Target Audience: Are you a researcher or a programming enthusiast? Then this course is for you. You need no prior knowledge in statistics or programming. However, knowledge of basic statistics will be an advantage.
Dates:
Mon to Fri : 21-25 July 2025: 9:00h-12:30h
Payment details:
Names: Adiaba Consulting Group
IBAN BE56 3632 3839 2088, BIC: INGBNL2A
(Please add following reference: your_last_name/RSTUDIO/2025)
Paypal: njiabatih@gmail.com
Register using the form below: