Big Data Processing on HPC
- Date
- Nov 7, 2024
- Time
- 10:00 AM - 3:00 PM
- Speaker
- Apurv Deepak Kulkarni, Wenyu Zhang
- Affiliation
- ScaDS.AI Dresden/Leipzig
- Series
- ScaDS.AI Training
- Language
- en
- Main Topic
- Informatik
- Description
- Apache Spark and Apache Flink are two typical Big Data analytics frameworks. Their APIs allow the development and testing of an application on a local workstation and later, without changing the source code of the application, distribute work to many computers when the local workstation is not sufficient anymore due to limited resources. The course Big Data Processing on HPC focuses on the step from a local workstation to an HPC environment and presents how the typical Big Data analysis workflow can be organized in an HPC environment. In this course participants will be introduced to running a data pipeline and data processing along with managing the configurations on the HPC environment, using Apache Flink and Apache Spark.
- Links
Last modified: Sep 24, 2024, 3:55:11 PM
Location
Online, please follow the internet link. (Will be announced after registration.)
Organizer
Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI)Chemnitzer Straße46b, 2. OG01187Dresden
- Phone
- +49 351 463-40900
- ScaDS.AI
- Homepage
- https://scads.ai
Legend
- Biology
- Chemistry
- Civil Eng., Architecture
- Computer Science
- Economics
- Electrical and Computer Eng.
- Environmental Sciences
- for Pupils
- Law
- Linguistics, Literature and Culture
- Materials
- Mathematics
- Mechanical Engineering
- Medicine
- Physics
- Psychology
- Society, Philosophy, Education
- Spin-off/Transfer
- Traffic
- Training
- Welcome