Page 1 of 3

Prague Data School

Prague Data School is an intensive 5-day course focused on working with data, statistics and artificial intelligence using the Python language. Participants will learn to clean, visualise and analyse real-world data in an interactive workshop environment. The course is perfect for anyone transitioning from Excel to Python.

↓UPCOMING DATA SCHOOL IN PRAGUE↓

https://summerdataschool.codebridgecollege.com/ REGISTRATION IS OPEN

Location: Prague, Pikrtova 1737/1a Duration: 5 full-time days (45 hours), 8:30 AM - 5:30 PM

Prerequisites: basic knowledge of Excel. Programming knowledge is not required.

The bootcamp is conducted in Czech/Slovak language.

Students are eligible to apply for a Scholarship.

Who is the course best for?

Analysts who want to transition from Excel to larger datasets, learn to model data and create visualisations.

PhD students and Researchers who work with data in their research and need to use statistics and regression models.

Professionals from various fields who want to gain practical knowledge in the area of data science, AI and statistics.

Students keen to learn the basics of data processing and AI.

Course Structure

Day 1: Introduction to Python and Data Processing

Fundamentals of programming in Python Working with data in Pandas

Data cleaning, filtering and transforming

Manipulating dates and strings

Groupby and aggregations, multi-index

Long and wide data formats

Data import and export, linking with Excel Day 2: Data Visualisation

Fundamentals of data visualisation in Python

Matplotlib, Seaborn, and Plotly libraries

Groupby + aggregations and their visualisation

Interactive charts and dashboard elements

Geographical and multivariate charts

Multivariate plots

Principles of data communication Day 3: Statistics and Regression Modelling

Fundamentals of statistics and probability

Statistical hypothesis testing

Interpretation of stat results and p-values

Linear and logistic regression

Metrics of predictive power

Correlation, causality and randomisation

Natural experiments

Statistics vs. machine learning Day 4: Machine Learning

AI from the ground up: concepts, types and uses

Training machine learning models

Classification and regression models

Sensitivity, specificity, ROC curve

Interpretation of ML model decision-making

Neural networks and deep learning

Unsupervised learning, t-SNE

Integration of LLMs into Python projects

Extraction of structured data from text

Day 5: Data Hackathon! In cooperation with our partners, we have prepared challenging data tasks using data from the healthcare and education sectors. The goal of the hackathon is for every participant to utilise their new data and programming skills directly in practice and at the same time, learn something new about important social topics. Several teams already came up with interesting findings in both tasks, which provided new insights for stakeholders.

Why Python? Python is considered the most suitable language for data science and machine learning today. It is clear, intuitive, and ideal even for beginners. Already on the first day, you will learn its fundamentals and see how to solve real data tasks with just a few lines of code.

Why Codebridge College? We understand that learning data science skills isn’t easy. It requires a new way of thinking, patience and the right support. At Codebridge College, we know how overwhelming it can feel to switch from Excel to Python, or from raw tables to statistical models and AI. That’s why our bootcamps are designed to be practical, clear and guided by mentors who walk you through every step. Discussion is encouraged and no question is left unanswered.

Participants consistently describe our courses as the most practical training they’ve attended and with new skills they can apply immediately. -> Feedback on our bootcamps

Graduate Profile

After completing the Prague Data School you will be able to:

Work with real-world data

Clean, transform and explore datasets using Python

Move beyond Excel and work efficiently with larger, messy data

Use pandas to filter, join, reshape and summarise information

Document your work in clear, reproducible notebooks

Create high quality visualisations

Build clear charts using Matplotlib, Seaborn and Plotly

Choose the right type of plot for the story you want to show

Design clean, readable figures for presentations and reports

Communicate insights with confidence

Apply statistical models in practice

Run and interpret common statistical tests (parametric and non-parametric)

Understand distributions, variance and sampling

Fit regression models and understand their output

Explain results in a way that makes sense for non-experts

Build and evaluate machine-learning models

Prepare data for ML (splits, encoding, scaling, feature selection)

Train, evaluate and explain models

Understand accuracy, precision, recall, ROC curves and overfitting

Choose the right model for classification or regression tasks

Use AI and LLM tools in data workflows

Integrate AI tools into your daily work to speed up coding, cleaning and documentation

Use LLMs to explain code, detect errors and improve clarity

Extract information and structured data from unstructured text documents Integrate LLM into your python project pipeline

Lecturers

Imrich Berta A graduate of Applied Mathematics at the University of Cambridge, with experience in machine learning models for disease prediction and clinical data analysis. He currently works as a consultant for government institutions and start-ups. He actively mentors analysts and organises programming and data workshops for students. Imrich enjoys helping people who do not consider themselves "math types" understand and apply mathematical and statistical principles in practice.

Laura Johanesova A bioinformatician studying at the University of Vienna. Skills in shell scripting, R and Python are crucial for her research in regeneration. She designs and leads intensive, practically oriented training sessions that help analysts transition from Excel to Python or R and applied machine learning. Laura's work includes the creation of educational programmes, the development of data skills in science and various projects in the field of healthcare and biomedical data.

Jan Dudek Magna cum laude economics and econometrics graduate from Rice and Oxford. Improved the risk-equalization model at the Ministry of Health and implemented ML fraud detection algorithms in Slovak healthcare. Currently a senior data scientist specializing in the health insurance industry.

Price

Private sector 28 000 CZK | Public sector 24 000 CZK | Academia 20 000 CZK

Prices are without VAT. 20 scholarships are available for students.

Contact

Laura Johanesova laura.johanesova@codebridgecollege.com