Parallel and Distributed Computing II весна 2025 — различия между версиями

Материал из Public ATP Wiki
Перейти к: навигация, поиск
(Полезные ссылки)
 
(не показана 1 промежуточная версия этого же участника)
Строка 1: Строка 1:
==Общие сведения==
+
==Information about course==
'''Преподаватель'''
+
This is one-term course designed for third-year undergraduate students pursuing a Bachelor's degree in Computer Science. It provides a comprehensive introduction to the principles and practices of distributed computing, with a specific focus on Big Data analytics and engineering.
*Иванова Юлия
+
 
==Полезные ссылки==
+
 
[https://t.me/+9pSPd59Dcr1iMTI6 Телеграм чат]
+
Prerequisites for this course include a foundational knowledge in Computer Science, proficiency in Python programming, familiarity with Linux environments, basic bash command usage, and experience with Git.
[https://docs.google.com/forms/d/e/1FAIpQLSciqI8bL8XgNK8Z7PDYNlh3-JORsz0h0al_8QY_hK2okCfCfA/viewform?usp=header Регистрация на курс]
+
 
 +
 
 +
Throughout the term, students will study key distributed computing frameworks and technologies such as HDFS (Hadoop Distributed File System), MapReduce, Hive, Apache Spark, and Spark Streaming. The course is structured to provide a balance of theoretical knowledge and practical application, with students gaining direct experience by working on the university's dedicated Hadoop cluster. To access the cluster for course-related projects and hands-on learning, students are required to fill out a [https://docs.google.com/forms/d/e/1FAIpQLSciqI8bL8XgNK8Z7PDYNlh3-JORsz0h0al_8QY_hK2okCfCfA/viewform form] to obtain an account.
 +
 
 +
 
 +
Upon completion, students will be well-prepared to tackle complex data processing tasks and work effectively in the undustry of Big Data analytics and engineering.
 +
 
 +
==Grading==
 +
[https://docs.google.com/spreadsheets/d/1V9o8icLtAnGIvn-KYLk2jIciWcmZjfYNaYRSx_5HBLo/edit?usp=sharing Grading criteria]
 +
 
 +
==Links==
 +
*[https://t.me/+9pSPd59Dcr1iMTI6 Телеграм чат]
 +
*[https://docs.google.com/forms/d/e/1FAIpQLSciqI8bL8XgNK8Z7PDYNlh3-JORsz0h0al_8QY_hK2okCfCfA/viewform?usp=header Регистрация на курс]
 +
*[https://gitlab.com/pd2020-supplementary/foreigners/-/tree/master?ref_type=heads Seminar materials and homeworks]
 +
 
 +
==Contacts==
 +
Teacher:
 +
Julia Ivanova
 +
tg: @lajulienn

Текущая версия на 12:03, 13 февраля 2025

Information about course

This is one-term course designed for third-year undergraduate students pursuing a Bachelor's degree in Computer Science. It provides a comprehensive introduction to the principles and practices of distributed computing, with a specific focus on Big Data analytics and engineering.


Prerequisites for this course include a foundational knowledge in Computer Science, proficiency in Python programming, familiarity with Linux environments, basic bash command usage, and experience with Git.


Throughout the term, students will study key distributed computing frameworks and technologies such as HDFS (Hadoop Distributed File System), MapReduce, Hive, Apache Spark, and Spark Streaming. The course is structured to provide a balance of theoretical knowledge and practical application, with students gaining direct experience by working on the university's dedicated Hadoop cluster. To access the cluster for course-related projects and hands-on learning, students are required to fill out a form to obtain an account.


Upon completion, students will be well-prepared to tackle complex data processing tasks and work effectively in the undustry of Big Data analytics and engineering.

Grading

Grading criteria

Links

Contacts

Teacher: Julia Ivanova tg: @lajulienn