Data Science Week Spring 2025

March 3, 2025— March 10, 2025

What is the Data Science Week?

The goal of the data science week is to introduce interested students and staff to data science in a fun and cooperative way, and help create a community of data scientists at the University of Twente, the faculty of Behavioural and Management Sciences, and beyond. BDSi and DSI organize various events during the week, including a datathon, contextual speakers, expert lectures, hands-on workshops, and a networking drink.

During the lunch breaks expert data scientists will provide lectures on the most important tools in a data scientists’ toolbox; data wrangling, modelling, and communicating results. These lectures will be structured to support the datathon materials, but can be attended without participating in the datathon itself. The lectures are followed by a hands-on practical session in which the lunch lecturerer - supported by a team of motivated coaches - will guide participants in applying the lecture materials to their datathon submissions.

(Guest) speakers will be invited during the week to provide a deeper background in the topics and methods covered in the lectures and datathon, or to put these topics in a broader context. Throughout the week there will be ample time for socialization and networking, as well as a poster presentation session and networking drink on Thursday afternoon.

Signups are now open!

You can now sign up for the various lunch talks, the datathon, and the workshops. Places are limited, so sign up now!

Sign up now

Datathon

A datathon is an event in which teams collaborate and compete to create a solution to a shared problem. By learning from experts and peers and immediately applying your skills on a relevant and engaging real-world dataset, the BDSi datathons provide a great environment for both students and staff, beginners and experts to further hone their skills. For the spring 2025, we will have a new and rewarding data challenge.

Speakers

Lucas Noldus Radboud Universiteit
prof. dr. Lucas Noldus

Lucas Noldus is Professor of Behavior, Information Technology and Innovation at Donders Institute for Brain, Cognition and Behavior, Radboud University and CEO of Noldus Information Technology.

His research is aimed at the discovery and development of new techniques for automatic behavioral recording in animals and humans. The topics are quite diverse and include generic AI models for behavior recognition in rodents and human infants, vocalizations in mice, EEG and behavior in mice, learning tasks in zebrafish, eye tracking in MS patients, and motion analysis in visually impaired people. He tries to build bridges between the University and companies in the field of behavior and technology.

Matthijs Noordzij
prof. dr. Matthijs Noordzij

Matthijs Noordzij is Full Professor of Health Psychology and Persuasive Technology and directs the Health Dynamics & Self Management Lab at the University of Twente in The Netherlands.

His research and education focuses on exploring the scientific foundations and design principles for integrating sensor technology in (mental) healthcare and self-management.

His vision is to develop health technology that aligns with core human values such as compassion, while striving to create innovative solutions that enhance the way we interact with technology in healthcare settings.

Arlene John

Arlene John is an Assistant Professor at the Biomedical Signals and Systems (BSS) group at the University of Twente. She completed her PhD on data fusion frameworks for wearable health monitoring devices at University College Dublin, and has previously worked on Machine Learning Mathematics at Qualcomm and ASML.

Her current research interests include biomedical signal processing, machine learning and inference, explainable AI, and multisensor data fusion.

Ying Wang
dr. Ying Wang

Ying Wang is an Assistant Professor at the Biomedical Signals and Systems (BSS) group at the University of Twente.

Her research is interdisciplinary, and applies and develops multi-modal model-based signal processing, sensing and physiological system modeling techniques in the healthcare field. Her main research interest is remote continuous monitoring of individual’s physiological signs (such as, heart activty) and body movement in daily life for personalized disease prevention and management.

She is especially enthusiastic in using her expertise to tackle challenges surrounding the daily monitoring of physiological (brain and body) responses to dynamic physical activities for different healthcare purposes, such as, helping people stay in healthy and tracking patients' disease symptoms for disease management.

Stay tuned for updates!

We’re coordinating with speakers inside and outside the UT, and will update the website once more details are known.

Lectures & Practicals

Every lunch break (12:45 - 13:30, Tuesday - Friday) expert data scientists from BDSi and our partners will provide a lecture on the most important tools in a data scientists’ toolbox; data wrangling, modelling, and communicating results. These lectures will be structured to support the datathon materials, but can be attended without participating in the datathon itself.

After a short coffee break (13:30 - 13:45), the lecture will be followed by a hands-on practical session (13:45 - 15:30). During these two hours, the lecturer - supported by a team of motivated coaches - will support participants in applying the lecture materials to their datathon submissions. While these sessions are meant to accompany the days’ lecture, they can be attended by any datathon participants. Coaches will be on hand to answer any questions about the days’ lecture, the datathon, or data science in general.

Sabine Siesling speaking on (in)Equity in breast cancer care for the Women in Data Science Week 2024
Sabine Siesling speaking on (in)Equity in breast cancer care for the Women in Data Science Week 2024

Posters & Drinks

On thursday afternoon, we invite all data science week participants as well as anyone interested in data science at the University of Twente to join us for a poster presentation and drinks. This is a great opportunity to mingle with the other teams, and create lasting connections with peers and data science experts!

Women in Data Science Drinks & Poster Presentations 2024

Competition

The team with the best solution will receive the coveted BDSi Data Science trophy. All teams will also be asked to share their solutions, problems, and learning experience during the final presentations.

Who can join?

Staff, students, family, and friends

Everyone related to the University of Twente and their friends and family can join. You can join with friends, colleagues or even family. The event is open to both novices and experts, and everyone in between. You can join the datathon as a team, alone, or skip it altogether and only participate in the workshops. If you do join alone, you can choose to be assigned to a team with other data science enthusiasts, or go at it alone.

Some experience with R or Python

Some programming knowledge is required!

You'll need to have a basic idea of either R or Python in order to follow along with the lectures and practicals. Materials will always be prepared for R, and when possible for Python as well.

While we will do our best to introduce data science topics in the various workshops without relying on code, a basic understanding of R and/or Python will make it much easier to follow along.

If you have some experience with other programming languages, you should be able to follow along with a little preparation. More information on installing and using R can be found in the What can I do to prepare? section.

If you're new to programming in general or would like a deeper understanding of R, and would rather learn from one of our colleagues, the Cognition, Data and Education (CoDE) section provides courses and materials aimed at teaching staff and Johannes Steinrücke teaches half-day introduction to R and data visualization in R workshops for PhD's (and EngD's).

If you’re confident you can participate in the datathon in another programming language, you’re more than welcome to do so (we challenge you to try in C, Fortran, Brainf***, or JavaScript). Just be aware that we probably can’t offer support if or when you get stuck.

What can I do to prepare?

Get a team

First off, get a team together. The datathon is meant to be a collaborative experience where you work alongside a variety of expertises. While you can compete on your own, we strongly suggest working together.

Set up your coding environment

If you’re new to data science, you’ll want to set up a working environment. We recommend working in R or Python, depending on your experience.

Install R and RStudio, and prepare a working environment - Our colleague Johannes Steinrücke has written a good guide on how to set up R and RStudio for your projects, including some practical advice not covered in many other sources. The guide was written for students starting with coursework with R, but is equally applicable for other data science projects.

Install Python - The Women in Data Science team maintains a set of tutorials on installing Python (using Anaconda to manage packages and environments), Jupyter notebooks and the basics of Python data structures: https://github.com/keikokamei/WiDS_Datathon_Tutorials.

Further reading

If you’re looking for more information, a competitive edge, or just a good way to spend some time, we can recommend some more reading materials:

An Introduction to Statistical Learning is a free to download book providing an excellent introduction to practical machine learning using both R and Python.

R for Data Science is a free online book compiled by Hadley Wickham and a long list of community contributors, covering the whole gamut of modern data science in R. It is well worth a look, and a good reference even for experienced data scientists.

Kaggle.com provides resources to get started with Kaggle, as well as a long list of competitions that are approachable for beginners - with code and discussions available from hundreds of other participants. Trying your hand at a competition or two is a good way to spend a rainy weekend.

Sharada Kalanidhi has written an excellent deep dive into the 2023 WiDS datathon, including links to further resources for both R and Python: https://www.widsworldwide.org/get-inspired/blog/a-data-scientists-deep-dive-into-the-wids-datathon/.

Signups are now open!

You can now sign up for the various lunch talks, the datathon, and the workshops. Places are limited, so sign up now!

Sign up now

Schedule

The spring 2025 Data Science Week takes place from March 3rd to March 10th. There will be an opening lecture and talk introducing the topic, the schedule for the week, and the technicalities of competing in the datathon.

The rest of the week, there will be lunch lectures on core data science skills, followed by practicals applying those skills to the problem in the datathon. Invited speakers will be fit into the schedule based on availability and opportunity. The data science week closes on Monday the 10th with presentations from the competing datathon teams, and a final keynote.

Further details will be made available in the coming weeks and months.

Monday

March 3
Keynote: digital phenotyping
12:45 - 13:30, Location: Citadel T300
Lucas Noldus Radboud Universiteit
prof. dr. Lucas Noldus

Lucas Noldus is Professor of Behavior, Information Technology and Innovation at Donders Institute for Brain, Cognition and Behavior, Radboud University and CEO of Noldus Information Technology.

His research is aimed at the discovery and development of new techniques for automatic behavioral recording in animals and humans. The topics are quite diverse and include generic AI models for behavior recognition in rodents and human infants, vocalizations in mice, EEG and behavior in mice, learning tasks in zebrafish, eye tracking in MS patients, and motion analysis in visually impaired people. He tries to build bridges between the University and companies in the field of behavior and technology.

Keynote by prof. Lucas Noldus. Abstract will follow soon.

Datathon: kick-off
13:45 - 14:30, Location: Citadel T300
Anna Machens
Anna Machens
Karel Kroeze
Karel Kroeze

Introduction to the datathon, finding and matching teams.

Datathon: Gathering data
14:45 - 16:30, Location: Citadel T300
Anna Machens
Anna Machens
Karel Kroeze
Karel Kroeze

Digital phenotyping often relies on highly personalized data. What could be more personalized than working on data gathered by and from you and the other participants? We will use real physiological measurements in a simple and fun experimental design to gather data from participants, for participants.

Tuesday

March 4
Digital phenotyping in social sciences research (working title)
12:45 - 13:30, Location: Citadel T300
Matthijs Noordzij
prof. dr. Matthijs Noordzij

Matthijs Noordzij is Full Professor of Health Psychology and Persuasive Technology and directs the Health Dynamics & Self Management Lab at the University of Twente in The Netherlands.

His research and education focuses on exploring the scientific foundations and design principles for integrating sensor technology in (mental) healthcare and self-management.

His vision is to develop health technology that aligns with core human values such as compassion, while striving to create innovative solutions that enhance the way we interact with technology in healthcare settings.

Talk by Matthijs Noordzij. Abstract fill follow soon.

Workshop: Data Wrangling
13:45 - 15:30, Location: Citadel T300
Anna Machens
Anna Machens
Karel Kroeze
Karel Kroeze

Cleaning and combining datasets, (visually) exploring data and patterns, preparing raw data for further analysis.

Wednesday

March 5
Advancing Digital Phenotyping: From Physiological Time Series Data to Real-Life Multimodal Health Monitoring.
12:45 - 13:30, Location: Citadel T300
Arlene John

Arlene John is an Assistant Professor at the Biomedical Signals and Systems (BSS) group at the University of Twente. She completed her PhD on data fusion frameworks for wearable health monitoring devices at University College Dublin, and has previously worked on Machine Learning Mathematics at Qualcomm and ASML.

Her current research interests include biomedical signal processing, machine learning and inference, explainable AI, and multisensor data fusion.

Ying Wang
dr. Ying Wang

Ying Wang is an Assistant Professor at the Biomedical Signals and Systems (BSS) group at the University of Twente.

Her research is interdisciplinary, and applies and develops multi-modal model-based signal processing, sensing and physiological system modeling techniques in the healthcare field. Her main research interest is remote continuous monitoring of individual’s physiological signs (such as, heart activty) and body movement in daily life for personalized disease prevention and management.

She is especially enthusiastic in using her expertise to tackle challenges surrounding the daily monitoring of physiological (brain and body) responses to dynamic physical activities for different healthcare purposes, such as, helping people stay in healthy and tracking patients' disease symptoms for disease management.

This talk explores the journey from physiological time-series data to multimodal data analysis for digital phenotyping, emphasizing the transition from controlled semi-lab environments to real-life health monitoring. The challenges and some innovations in daily-life health monitoring required to sense information unobtrusively to enable the development of personalized phenotypes for continuous health tracking is discussed. Key topics include both wireless and wearable sensing techniques, multimodal feature extraction, identifying interrelationships amongst features, and connecting these insights to individual phenotypes.
Additionally, we examine methods for monitoring health trends over extended periods. Practical applications discussed will include energy expenditure monitoring during daily physical activity for people with risk of obesity, cardiac function monitoring for people with long term diabetes, psychophysiological condition monitoring for people with knee osteoarthritis, recovery tracking post-colorectal surgery using patch sensors, and smartphone-based digital phenotyping for breast cancer survivors.

Workshop: Feature Engineering
13:45 - 15:30, Location: Citadel T300
Anna Machens
Anna Machens
Karel Kroeze
Karel Kroeze

How to go from ‘raw’ sensor data to usable signals or ‘features’. We will provide some background and context for the choices you will have to make (or that are made for you with commercial products), and implement a basic pipeline for extracting features from raw signals data in R.

Thursday

March 6
(to be announced…)
12:45 - 13:30, Location: Citadel T300
Workshop: Modelling
13:45 - 15:30, Location: Citadel T300
Anna Machens
Anna Machens

Introduction to modelling and machine learning in R.

Posters & Drinks
16:00 - 18:00, Location: TBA

(Social) networking with other participants, and other University of Twente students and staff interested in data science.

Friday

March 7
(to be announced…)
12:45 - 13:30, Location: Citadel T300
Workshop: Modelling II
13:45 - 15:30, Location: Citadel T300
Anna Machens
Anna Machens
Karel Kroeze
Karel Kroeze

More in-depth continuation of modelling, implications of different modeling choices and ‘fairness’ in AI models. [Note: exact contents may still change]

Sunday

March 9
Datathon: Submission deadline
23:59 - , Location:

Deadline for datathon submissions on the leaderboard.

Monday

March 10
Datathon: Team presentations
12:45 - 13:30, Location: Citadel T300

Presentations by the datathon participants. Solutions, challenges, and lessons learned.

Datathon: Prize ceremony
13:45 - 14:00, Location: Citadel T300

Prizes for the best and most innovative teams.

Final keynote
14:00 - 14:30, Location: Citadel T300

Final keynote speech and official closing of the Data Science Week.

Signups are now open!

You can now sign up for the various lunch talks, the datathon, and the workshops. Places are limited, so sign up now!

Sign up now