Master degree program
Data Science

Data Science

QUALIFICATION

  • Scientific and pedagogical direction - Master of Engineering Sciences

MODEL OF GRADUATING STUDENT

1. Apply methods for collecting, preprocessing, visualizing data from heterogeneous sources to get an idea of the subject area under study, identify patterns and support decision-making based on data analysis.
2. Apply the methods of statistical analysis, linear algebra, optimization, mathematical analysis and computational tools necessary to effectively extract useful information from structured and unstructured data sets of any size.
3. Develop data processing applications, implement basic computational algorithms for data analysis, evaluate the computational complexity of algorithms, design and use relational and non-relational databases, and carry out practical data analysis projects in collaboration with industry partners.
4. Organize, visualize and analyze large complex datasets using descriptive statistics methods, develop big data management applications in various fields, develop, install and configure cloud computing applications, and apply virtual machine computing environments for scalable data processing.
5. Explore various use cases for blockchain technology in various industries, design and develop decentralized applications based on blockchain technology, take into account ethical issues, analyze the potential consequences of using blockchain for society and the economy.
6. Develop and optimize machine learning models and methods for data analysis and visualization in solving applied problems, apply deep learning models in scientific research, innovative projects and real applications.
7. Analyze data privacy issues, comply with ethical standards, privacy principles and data security measures related to the collection, analysis and use of data in various contexts, apply technical mechanisms to ensure data security and confidentiality.
8. Conduct an in-depth analysis of the research area to select suitable data analysis methods, use knowledge and skills to continue learning and adapt to new data processing technologies, develop critical thinking about data and data analysis-based decisions, lead a research team.
9. Independently conduct scientific research, understand current research issues, analyze and critically relate to various sources of information, use them to structure and formulate reasoning, conduct scientific and pedagogical activities, implement research results in practical pedagogical activities.
10. Apply methods and tools for data analysis in various multidisciplinary areas, present research results in various forms in national scientific publications, at conferences, taking into account the specifics of the audience, be tolerant, work effectively in a team when searching for and solving research problems.

Program passport

Speciality Name
Data Science
Speciality Code
7M06115
Faculty
Information technology

disciplines

Applied Machine Learning
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to develop the ability to apply machine learning algorithms and methods for analyzing and visualizing data in solving applied problems. Within the framework of the discipline, the following aspects are considered: Data preprocessing and feature engineering. Methods for Feature Selection and Dimension Reduction. Supervised learning algorithms. Learning algorithms without a teacher. Model evaluation and validation. Evaluation metrics for classification, regression and clustering. Cross validation and hyperparameter tuning. Advanced machine learning methods. Practical applications and case studies

Big Data Management
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of this discipline is to develop the ability to manage big data, design, and implement tools necessary for working with large datasets. Content of the discipline: Big data ecosystem. Big data analysis tools. Distributed database management systems. Designing big data management systems. NoSQL databases. Key-value stores. Distributed file systems. Distributed data processing. Stream management and processing.

Database design and management
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to form the ability to design and use relational and non-relational databases, build complex reports and conduct deep data analysis through queries. Within the framework of the discipline the following aspects are considered: Relational data model. Analysis and design, implementation of a relational database using SQL. Designing NoSQL databases. Prepare, explore and extract data from a NoSQL database.

Ethics, privacy and data security
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to develop the ability to comply with ethical standards, confidentiality principles and data security measures related to the collection, analysis and use of data in various contexts Content of the discipline: Ethical standards in handling data. Data protection rules. Ways to keep data confidential. Encryption/decryption, message authentication, data integrity. Threat models for data-driven applications. Public Key Cryptography for Data Protection

Foreign Language (professional)
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to form practical skills in various types of speech activity in a foreign language. The training course builds the ability to perceive, understand and translate information in the modern global space, participate in scientific events to test their own research. The discipline is aimed at improving competencies in accordance with international standards of foreign language education.

Generative Artificial Intelligence: Technologies and Applications
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The course aims to introduce master students to the fundamentals and advanced techniques of generative models and their practical applications in various domains such as image creation, text generation and voice synthesis. Objectives include learning generative algorithms and creating projects using generative artificial intelligent technologies.

History and Philosophy of Science
  • Number of credits - 3
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to be considered on the basis of historical dynamics and in a historically changing socio - cultural context. Introduces the problems of the phenomenon of Science, which is a subject of special philosophical analysis, forms knowledge about the history and theory of Science, the laws of the development of Science and the structure of scientific knowledge, the features of science as a specialty and social institution, the role of Science in the development of society.

Mathematical Methods in Data Science
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of the discipline is to develop the ability to apply mathematical methods, including probability, linear algebra, calculus and optimization, ordinary differential equations and partial differential equations, in data science. Course content: Linear least squares and singular value decomposition. Spectral Graph Theory and Applications. Optimization in machine learning. Optimal conditions. Convex. Gradient Descent: Convergence Analysis. back propagation. Stochastic gradient descent for logistic regression. Probabilistic models: key concepts and examples.

Organization and Planning of Scientific Research (in English)
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline to form the ability to apply practical skills in the organization and planning of scientific research. The discipline studies: forms and methods of planning, organization and design of scientific articles and dissertations; forms of summarizing the results of scientific research in presentations, speeches, projects, articles.

Pedagogy of Higher education
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose is the formation of the ability of pedagogical activity through the knowledge of higher education didactics, theories of upbringing and education management, analysis, and self-assessment of teaching activities. The course covers the educational activity design of specialists, Bologna process implementation, acquiring a lecturer, and curatorial skills by TLA-strategies.

Programming for Data Science
  • Number of credits - 5
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of this discipline is to develop the ability to create applications for data processing and manage software development projects. Course content: Software development life cycle. Preprocessing of raw data. Key stages of creating a data analysis pipeline. Data processing. Methods of knowledge extraction from data. Data visualization. Interacting with machine learning frameworks. Developing software for data analytics. Parallelism in Python. Using Git and GitHub.

Psychology of Management
  • Number of credits - 3
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The course reveals the subject, the basic principles of management psychology, personality in managerial interactions, personal behavior management, psychology of managing group phenomena and processes, psychological characteristics of the leader's personality, individual management style, psychology of influence in management activities, conflict management.

Data for 2021-2024 years

disciplines

Artificial Intelligence in Natural Language Processing (NLP)
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to teach master students the fundamentals and advanced techniques of NLP so that they can design and implement intelligent systems for text processing. Objectives include the study of language representation models, sentiment analysis, machine translation, and other applications of NLP.

Basics of Blockchain Engineering
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the discipline is to develop the ability to develop and deploy blockchain applications using Solidity for various blockchain platforms. Course content: Introduction to blockchain technology. Principles of operation of distributed registries. Consensus mechanisms used in blockchain. Creation and deployment of smart contracts. The process of creating and deploying blockchain applications. Basic principles of cryptography used in blockchain.

Big Data Management
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of this discipline is to develop the ability to manage big data, design, and implement tools necessary for working with large datasets. Content of the discipline: Big data ecosystem. Big data analysis tools. Distributed database management systems. Designing big data management systems. NoSQL databases. Key-value stores. Distributed file systems. Distributed data processing. Stream management and processing.

Blockchain Bisiness Models
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the course is to teach undergraduates the basics and practical aspects of business models related to blockchain technology. Content: Blockchain-based business models. Decentralized platforms, markets, supply chain management, financial services and other applications. Blockchain economics. Economic aspects of the blockchain, economic incentives and rewards, voting and decision-making mechanisms based on the blockchain. Financial models and business models. Regulation and legal aspects. The use of blockchain in various industries.

Blockchain systems architecture
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the discipline is to develop the ability to understand the basics of the blockchain architecture and its components, to apply decentralized networks and consensus mechanisms in the deployment and implementation of the blockchain. Contents: Basic concepts of the blockchain. Types of blockchain systems. Advantages and disadvantages. Architectural components of the blockchain. Protocols and consensus algorithms. The principles of operation of the Proof of Work (PoW), Proof of Stake (PoS), Delegated Proof of Stake (DPoS) algorithms and others. Scalability and performance. blockchain systems. Interoperability and standards. blockchain systems. Standards and protocols ERC-20, ERC-721, Hyperledger Fabric and others

Cloud Computing for Data Science
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the discipline is to develop the ability to apply the MapReduce model and virtual machine computing environments for scalable data processing. Course content: Parallel programming using MapReduce. Clouds with infrastructure, platform and software as a service. Technologies and tools of virtualization. Cloud data storage. NoSQL databases and parallel query processing. Processing streaming data.

Cognitive Systems and Neural Networks
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of the course is to familiarize master students with the principles of neural networks and their application in various cognitive systems. The objectives include the study of basic algorithms and models of neural networks, as well as their application in signal processing, reinforcement learning and other areas of artificial intelligence.

Data Collection, Visualization and Analysis
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to develop the ability to apply effective data collection strategies, data visualization, analysis and interpretation techniques to obtain meaningful conclusions in data-driven decision making. Course content: Data collection methods. Principles and tools of data visualization. Exploratory data analysis. Methods of statistical analysis. Analysis and interpretation of data. Data mining. Predictive modeling and machine learning methods for data-driven decision making. ethical awareness. Data security and privacy

Data Engineering
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to develop the ability to apply effective data collection strategies, data visualization, analysis and interpretation techniques to obtain meaningful conclusions in data-driven decision making. Course content: Data collection methods. Principles and tools of data visualization. Exploratory data analysis. Methods of statistical analysis. Analysis and interpretation of data. Data mining. Predictive modeling and machine learning methods for data-driven decision making. ethical awareness. Data security and privacy

Data Science Applications for Industry (project)
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of the discipline is to develop the ability to carry out practical data analysis projects in collaboration with industry partners. Content of discipline: Research methods. Literature analysis. Scope estimation and project planning. Collection and pre-processing of data. Exploratory data analysis. Choice, development of data analysis algorithm. Development and selection of functions, construction and evaluation of models. Interpretation and visualization of results. Presentation of the results of data analysis and preparation of a report.

Decentralized applications
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to form the ability to design, develop, deploy and test decentralized applications based on blockchain technology. Contents: Fundamentals of decentralization. The concept of decentralization and various approaches to its implementation in applications. Blockchain technology. Basic principles of blockchain technology and its role in decentralized applications. Ethereum, EOS, NEO decentralized application platforms and others. Development of decentralized applications. Methods and tools for developing decentralized applications.

Deep Learning
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The purpose of the discipline is to develop the ability to use the methods of deep neural networks for the analysis of big data. Course content: The place of deep learning in the context of statistics and machine learning. Training and validation of deep models. Deep learning models and their applications. Architecture of deep neural networks. Setting hyperparameters. Convolutional neural networks. Recurrent neural networks. Acceleration of training of convolutional neural networks.

Deep Learning in Computer Vision
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to train master students in deep machine learning methods specific to image processing and analysis. Objectives include the study of basic deep learning algorithms and models, and their application to pattern recognition, image segmentation, and other computer vision problems.

Introduction to web3
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of the course is to develop the ability to understand the basic principles of decentralization and apply web3 technologies to create various types of decentralized applications. The course covers the following aspects: Web3 and decentralization. Ethereum and smart contracts. The basics of smart contracts and their role in the development of decentralized applications on Ethereum. Web3.js and other libraries. Integration with web applications. Methods for integrating decentralized functions into web applications

Machine Learning and Data Analytics
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to provide master students with fundamental knowledge and skills in machine learning and data analytics to effectively analyze, interpret, and utilize large amounts of data. Objectives include learning the basic techniques of machine learning, data analytics and their practical application in solving real-world problems.

Programming for Data Science
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of this discipline is to develop the ability to create applications for data processing and manage software development projects. Course content: Software development life cycle. Preprocessing of raw data. Key stages of creating a data analysis pipeline. Data processing. Methods of knowledge extraction from data. Data visualization. Interacting with machine learning frameworks. Developing software for data analytics. Parallelism in Python. Using Git and GitHub.

Reinforcement Learning
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to teach master students the basic concepts and algorithms of reinforcement learning to create autonomous agents capable of making decisions in dynamic environments. The objectives include learning the theory of reinforcement learning, implementation of the algorithms and their application in various practical scenarios.

Smart contracts architecture
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The goal of the course is to teach undergraduates the basics of smart contract architecture and provide them with an understanding of the principles and methods for designing efficient and reliable smart contracts. The course covers the following aspects: Introduction to smart contracts. The concept of smart contracts and their role in blockchain technology. Basic principles of smart contracts, their structure and main functions. Programming languages for smart contracts. Smart contract architecture. Design principles and architecture of smart contracts. Modularity, inheritance, interfaces and other aspects for the development of flexible and efficient smart contracts. State management and data storage. Data structure and basic operations for storing and modifying data in smart contracts.

Statistics for data science
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The aim of this discipline is to develop the ability to apply statistical procedures for data analysis using programming languages. Content of the discipline: Data distributions and sampling. Confidence intervals. Hypothesis testing. Statistical experiments and significance testing. Regression and prediction. Linear regression models, analysis of variance. Classification. Statistical machine learning. Unsupervised learning.

Theoretical Foundations of Artificial Intelligence
  • Type of control - [RK1+MT+RK2+Exam] (100)
  • Description - The course aims to provide master students with a thorough understanding of the key concepts and theoretical foundations of artificial intelligence. Objectives include the study of basic algorithms, models and principles, and their application to the creation of intelligent systems in various domains such as natural language processing and computer vision.

Data for 2021-2024 years

INTERNSHIPS

Pedagogical
  • Type of control - Защита практики
  • Description - Formation of practical, educational-methodical skills of conducting lectures, seminars, creatively apply scientific, theoretical knowledge, practical skills in teaching activities, conduct training sessions in the disciplines of the specialty; own modern professional techniques, methods of training, use in practice the latest theoretical, methodological advances, make educational, methodological documentation.

Research
  • Type of control - Защита практики
  • Description - The purpose of the practice: gaining experience in the study of an actual scientific problem, expand the professional knowledge gained in the learning process, and developing practical skills for conducting independent scientific work. The practice is aimed at developing the skills of research, analysis and application of economic knowledge.

Data for 2021-2024 years