EduXchange.nl

Big Data

INF33806

About this course

Big Data usually refers to data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process within a tolerable elapsed time. With the advancements in computing, the realization of Big Data systems has now become feasible and can trigger innovation and growth for various application domains.
This course will discuss both the key concepts of Big Data and provide hands-on-experience in developing and using Big Data systems. We introduce concepts related to Big Data system architectures, distributed systems, the Map-Reduce framework, scalable linear and machine learning models, and how they can be used with cutting-edge software platforms. Students will practice with tools via individual tutorials, and gain hands-on experience by working on a group project formed as a "data challenge". Students will not only demonstrate their skills achieved in the course, but also their creativity as data scientists, which includes communicating the value of their findings with visualization tools. The course has been designed in such a way that it is accessible for students of all disciplines in life and social sciences, for example food and health, biosystems engineering, bioinformatics, geo-information science, environmental science and plant science, amongst others.

Learning outcomes

After successful completion of this course students are expected to be able to:

  • Understand the basic concepts related to Big Data and data-driven value-creation in the environmental, social and life sciences.
  • Apply various tools in the big data ecosystem for handling big data.
  • Design a big data application derived from a big data reference architecture through requirement analysis and basic software modelling.
  • Build a big data system for a real-world life science application implementing scalable descriptive and predictive data analytics techniques.
  • Communicate meaningful patterns in data through data visualisation, reporting, presentation and documentation.
  • Determine the value of data-driven innovation, and associate it with their own course of studies.

Prior knowledge

Assumed Knowledge:

Fundamentals of programming (e.g. INF22306 Programming in Python).
Specifically you should be acquainted with the following concepts and techniques:

  • variables, assignment, expressions, operators;

  • functions (and/or procedures, subroutines, methods) and parameters; also making your own functions;

  • control structures: at least: if, for, while;

  • data structure (lists, tuples, dictionaries);

  • libraries for data manipulation and visualization (Numpy, Pandas and Matplotlib)

Familiarity with relational databases (e.g. INF21306 Data Management) is of added value.

Additional information

course
6 ECTS • broadening
  • Level
    bachelor
If anything remains unclear, please check the FAQ of Wageningen University.

Starting dates

  • 27 Oct 2025

    ends 19 Dec 2025

    LocationWageningen
    LanguageEnglish
    Term *Period 2
    Monday 09:00 - 13:00, Tuesday 09:00 - 13:00, Thursday 09:00 - 13:00, Friday 09:00 - 13:00
    Register between 1 Jun, 00:00 - 28 Sept
These offerings are valid for students of Utrecht University