Vai al contenuto principale
Oggetto:

Programming for Data Science

Oggetto:

Programming for Data Science

Oggetto:

Academic year 2023/2024

Course ID
ONC0254
Teacher
Marco Beccuti (Lecturer)
Year
1st year,
Teaching period
First semester
Type
Basic
Credits/Recognition
6
Course disciplinary sector (SSD)
INF/01 - informatics
Delivery
Formal authority
Language
English
Attendance
Obligatory
Type of examination
Written and oral (optional)
Oggetto:

Sommario del corso

Oggetto:

Course objectives

Since this course is divided into two modules, we reported hereafter the objective for each module

1) R Programming for data science

The module aims to introduce methods, techniques, and related computer science instruments for the analysis of experimental data.

2) Python programming for data science

The module aims to introduce methods, techniques, and related computer science instruments for the analysis of experimental data using Python language.

Oggetto:

Results of learning outcomes

KNOWLEDGE AND UNDERSTANDING – Completing the course students will be able to:

1) use suitable descriptive and inferential statistical techniques in R and Python to describe and understand the phenomena being studied;

2) manage suitable R and Python instruments for statistical data analysis.

APPLYING KNOWLEDGE AND UNDERSTANDING – Students will perform the statistical analyses required by the problem under study by developing R and Python programs

MAKING JUDGEMENTS – Students will decide which statistical techniques in R and Python to use according to the available data sets to describe and understand the phenomena under consideration.

COMMUNICATION – The student will be able to justify the choices for the analysis to be performed and to give a synthetic description of the techniques employed and of the results obtained.

Oggetto:

Program

Since this course is divided into two modules, we reported hereafter the program for each module.

1) R Programming for data science

  • Introduction to Data science;

  • Visualization using ggplot2;

  • Basic R functionalities:

    • Data structures: vector, matrix, list and data frame, tibble;

    • Apply operators;

    • Input/output operator;

    • Package and library.

  • Tidy data in R

  • Data Transformation;

  • Programming with R:

    • Function;

    • Flow control: if,for, while, break ... statements;

    • Debugging in R.

  • Creation of package in R

 

2) Python programming for data science

  • Introduction to Python and associated development environments

  • Numpy basics: array and vectorized computation

  • Getting started with Pandas

  • Data loading and storage in Python

  • Data wrangling: clean, transform, merge, and reshape

  • Data plotting and visualization

 

Oggetto:

Course delivery

The course consists of  20 hours of lectures and 28 hours of laboratories. Laboratories include exclusively practical activities.

The slides presented during lectures are available to students as online materials.

Attendance of lessons is not mandatory, but highly recommended due to the necessity of learning and employing specific computer science instruments.

Oggetto:

Learning assessment methods

The exam consists of an oral test  and requires a practice exercise on R and Python  programming languages

Suggested readings and bibliography

Oggetto:

1) R Programming for data science

- Garrett Grolemund and Hadley Wickham, R for Data Science, O'Reilly Media, Inc, USA, 2017.

- P. Dalgaard, Introductory Statistics with R, Springer 2008

- The R Manuals: An Introduction to R (http://cran.r-project.org/doc/manuals/r-releas /Rintro.pdf)

The teaching material used for lessons and a series of practical exercises are available on the website of the course.

2) Python programming for data science

Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter 3rd Edition
by Wes McKinney

The teaching material used for lessons and a series of practical exercises are available on the website of the course.

 

 



Oggetto:
Last update: 15/03/2024 15:11
Location: https://www.aibh.unito.it/robots.html
Non cliccare qui!