Data Science

Professors

Soutir Bandyopadhyay, Applied Mathematics and Statistics

Dorit Hammerling, Applied Mathematics & Statistics

Teaching Professor

Wendy Fisher, Computer Science

Teaching Assistant Professors

Nathan Lenssen, Applied Mathematics and Statistics

Zibo Wang, Computer Science

Research Associate Professor

Zane Jobe, Geology and Geological Engineering

Professor Emeritus

Douglas Nychka, Applied Mathematics & Statistics

Degrees Offered

 

Data Science Master of Science (Non-Thesis) 

The field of Data Science draws on elements of computer science, statistics and interdisciplinary applications to address the unique needs of gaining knowledge and insight through data analysis. This Masters Non-Thesis program is designed to give candidates a foundation in statistics and computer science and also provide knowledge in a particular application domain of science or engineering. The balance between these three elements is a strength of the program and can prepare candidates for Data Science careers in industry, government, or for further study at the PhD level. Moreover, the coursework will be flexible and tailored to each candidate. For example, the program will allow a candidate to increase his/her skills in data analytics while developing a focused area of application or alternatively allow a candidate with depth in an area of application to gain skills in statistics and computer science.

Program Requirements

Students take 18 credits of core courses, 3 credits of professional development coursework and at least 9 credits of electives for a minimum total of 30 credits. 

Core Courses
Data Modeling and Statistical Learning
MATH530INTRODUCTION TO STATISTICAL METHODS (Online Only) Students who have passed MATH335 should replace MATH530 with any of the following: MATH500, MATH531, MATH532, MATH533, MATH534, MATH535, MATH536, MATH537, MATH538, or MATH5513.0
MATH560INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I (Online Only)3.0
MATH561INTRODUCTION TO KEY STATISTICAL LEARNING METHODS II (Online Only)3.0
Machine Learning, Data Processing and Algorithms, and Parallel Computation
DSCI503ADVANCED DATA SCIENCE3.0
DSCI570INTRODUCTION TO MACHINE LEARNING3.0
DSCI575ADVANCED MACHINE LEARNING3.0
Professional Development Requirement

Choose courses from the table below in any combination to meet or exceed 3 credits.

Professional Development
SYGN502INTRODUCTION TO RESEARCH ETHICS1.0
SYGN5XXLEADERSHIP AND TEAMWORK1.0
SYGN683ORAL COMMUNICATION SKILLS1.0
SYGN684WRITING SKILLS2.0
LICM501PROFESSIONAL ORAL COMMUNICATION1.0
EBGN562STRATEGIC DECISION MAKING3.0
EBGN563MANAGEMENT OF TECHNOLOGY AND INNOVATION3.0
EBGN566TECHNOLOGY ENTREPRENEURSHIP3.0
EBGN577LEADING & MANAGING HIGH PERFORMING TEAMS3.0
PHGN503RESPONSIBLE CONDUCT OF RESEARCH1.0
SYGN598SPECIAL TOPICS (Graduate Internship)1-3
Electives

Electives are designed by the student to fulfill their interests and career goals. The intent is to give you the knowledge and skills to apply data science techniques in various fields. Although this individualized module can draw on graduate courses from across the university the courses given below are approved options:

BIOL510BIOINFORMATICS3.0
CBEN505NUMERICAL METHODS IN CHEMICAL ENGINEERING3.0
CBEN507APPLIED MATHEMATICS IN CHEMICAL ENGINEERING3.0
CBEN509ADVANCED CHEMICAL ENGINEERING THERMODYNAMICS3.0
CBEN511NEUROSCIENCE, MEMORY, AND LEARNING3.0
CBEN516ADVANCED TRANSPORT PHENOMENA3.0
CBEN518REACTION KINETICS AND CATALYSIS3.0
CBEN530TRANSPORT PHENOMENA3.0
CBEN532TRANSPORT PHENOMENA IN BIOLOGICAL SYSTEMS3.0
CBEN624APPLIED STATISTICAL MECHANICS3.0
CBEN625MOLECULAR SIMULATION3.0
CEEN505NUMERICAL METHODS FOR ENGINEERS3.0
CEEN506FINITE ELEMENT METHODS FOR ENGINEERS3.0
CEEN519RISK ASSESSMENT IN GEOTECHNICAL ENGINEERING3.0
CEEN530ADVANCED STRUCTURAL ANALYSIS3.0
CEEN531STRUCTURAL DYNAMICS3.0
CEEN533MATRIX STRUCTURAL ANALYSIS3.0
CEEN546STATISTICAL METHODS FOR RELIABILITY AND ENGINEERING DESIGN3.0
CEEN566MICROBIAL PROCESSES, ANALYSIS AND MODELING3.0
CHGC504METHODS IN GEOCHEMISTRY3.0
CHGN507ADVANCED ANALYTICAL CHEMISTRY3.0
CHGN583PRINCIPLES AND APPLICATIONS OF SURFACE ANALYSIS TECHNIQUES3.0
CSCI507INTRODUCTION TO COMPUTER VISION3.0
CSCI532ROBOT ETHICS3.0
CSCI536HUMAN-ROBOT INTERACTION3.0
CSCI544ADVANCED COMPUTER GRAPHICS3.0
CSCI555GAME THEORY AND NETWORKS3.0
CSCI561THEORY OF COMPUTATION3.0
CSCI562APPLIED ALGORITHMS AND DATA STRUCTURES3.0
CSCI563PARALLEL COMPUTING FOR SCIENTISTS AND ENGINEERS3.0
CSCI564ADVANCED COMPUTER ARCHITECTURE3.0
CSCI565DISTRIBUTED SYSTEMS3.0
CSCI571ARTIFICIAL INTELLIGENCE3.0
CSCI572COMPUTER NETWORKS II3.0
CSCI574THEORY OF CRYPTOGRAPHY3.0
CSCI575ADVANCED MACHINE LEARNING3.0
CSCI576DEEP LEARNING3.0
CSCI577ADVANCED ELEMENTS OF GAMES AND GAME DEVELOPMENT3.0
CSCI578BIOINFORMATICS3.0
CSCI581QUANTUM PROGRAMMING3.0
CSCI582COMPUTING BEYOND CPU'S3.0
CSCI583IOT SECURITY AND PRIVACY 3.0
CSCI585INFORMATION SECURITY PRIVACY3.0
EBGN509MATHEMATICAL ECONOMICS3.0
EBGN525BUSINESS ANALYTICS3.0
EBGN526STOCHASTIC MODELS IN MANAGEMENT SCIENCE3.0
EBGN527BUSINESS OPTIMIZATION MODELS3.0
EBGN528INDUSTRIAL SYSTEMS SIMULATION3.0
EBGN547FINANCIAL RISK MANAGEMENT3.0
EBGN559SUPPLY CHAIN ANALYTICS3.0
EBGN560DECISION ANALYTICS3.0
EBGN570ENVIRONMENTAL ECONOMICS3.0
EBGN571MARKETING ANALYTICS3.0
EBGN590ECONOMETRICS I3.0
EBGN594TIME-SERIES ECONOMETRICS3.0
EBGN645COMPUTATIONAL ECONOMICS3.0
EENG507INTRODUCTION TO COMPUTER VISION3.0
EENG509SPARSE SIGNAL PROCESSING3.0
EENG511CONVEX OPTIMIZATION AND ITS ENGINEERING APPLICATIONS3.0
EENG514DATA SCIENCE FOR ELECTRICAL ENGINEERING3.0
EENG515MATHEMATICAL METHODS FOR SIGNALS AND SYSTEMS3.0
EENG519ESTIMATION THEORY AND KALMAN FILTERING3.0
EENG521NUMERICAL OPTIMIZATION3.0
EENG528COMPUTATIONAL ELECTROMAGNETICS3.0
EENG581POWER SYSTEM OPERATION AND MANAGEMENT3.0
EENG584POWER SYSTEM RISK MANAGEMENT3.0
GEGN532GEOLOGICAL DATA ANALYSIS3.0
GEGN568POINT CLOUD DATA ANALYSIS IN EARTH SCIENCE AND ENGINEERING3.0
GEGN579PYTHON SCRIPTING FOR GEOGRAPHIC INFORMATION SYSTEMS3.0
GEGN583MATHEMATICAL MODELING OF GROUNDWATER SYSTEMS3.0
GEGN586NUMERICAL MODELING OF GEOCHEMICAL SYSTEMS3.0
GEGN671LANDSLIDES: INVESTIGATION, ANALYSIS & MITIGATION3.0
GEOL527SWIR (SHORT WAVELENGTH INFRA-RED) SPECTRAL ANALYSIS1.0
GEOL613GEOLOGIC RESERVOIR CHARACTERIZATION3.0
GPGN658SEISMIC WAVEFIELD IMAGING3.0
HASS578GLOBAL ENVIRONMENTAL ISSUES3.0
MATH500LINEAR VECTOR SPACES3.0
MATH501APPLIED ANALYSIS3.0
MATH506COMPLEX ANALYSIS II3.0
MATH510ORDINARY DIFFERENTIAL EQUATIONS AND DYNAMICAL SYSTEMS3.0
MATH514APPLIED MATHEMATICS I3.0
MATH515APPLIED MATHEMATICS II3.0
MATH531THEORY OF LINEAR MODELS3.0
MATH532SPATIAL STATISTICS3.0
MATH533 TIME SERIES ANALYSIS AND ITS APPLICATIONS3.0
MATH534MATHEMATICAL STATISTICS I3.0
MATH535MATHEMATICAL STATISTICS II3.0
MATH536ADVANCED STATISTICAL MODELING3.0
MATH537MULTIVARIATE ANALYSIS3.0
MATH538STOCHASTIC MODELS3.0
MATH540PARALLEL SCIENTIFIC COMPUTING3.0
MATH551COMPUTATIONAL LINEAR ALGEBRA3.0
MATH552KERNEL-BASED APPROXIMATION METHODS3.0
MATH557INTEGRAL EQUATIONS3.0
MATH559ASYMPTOTICS3.0
MATH572MATHEMATICAL AND COMPUTATIONAL NEUROSCIENCE3.0
MATH582STATISTICS PRACTICUM3.0
MEGN502ADVANCED ENGINEERING ANALYSIS3.0
MEGN535MODELING AND SIMULATION OF HUMAN MOVEMENT3.0
MEGN536COMPUTATIONAL BIOMECHANICS3.0
MEGN544ROBOT MECHANICS: KINEMATICS, DYNAMICS, AND CONTROL3.0
MEGN545ADVANCED ROBOT CONTROL3.0
MEGN553COMPUTATIONAL FLUID DYNAMICS3.0
MEGN554ORBITAL MECHANICS3.0
MEGN571ADVANCED HEAT TRANSFER3.0
MEGN586LINEAR OPTIMIZATION3.0
MEGN587NONLINEAR OPTIMIZATION3.0
MEGN588INTEGER OPTIMIZATION3.0
MEGN592RISK AND RELIABILITY ENGINEERING ANALYSIS AND DESIGN3.0
MEGN686ADVANCED LINEAR OPTIMIZATION3.0
MEGN688ADVANCED INTEGER OPTIMIZATION3.0
MNGN502GEOSPATIAL BIG DATA ANALYTICS3.0
PEGN501APPLICATIONS OF NUMERICAL METHODS TO PETROLEUM ENGINEERING3.0
PEGN513RESERVOIR SIMULATION I3.0
PEGN590RESERVOIR GEOMECHANICS3.0
PEGN624COMPOSITIONAL MODELING - APPLICATION TO ENHANCED OIL RECOVERY3.0
PHGN511MATHEMATICAL PHYSICS3.0
PHGN519FUNDAMENTALS OF QUANTUM INFORMATION3.0
PHGN520QUANTUM MECHANICS I3.0
PHGN521QUANTUM MECHANICS II3.0
PHGN530STATISTICAL MECHANICS3.0
ROBO517INTRODUCTION TO COMPUTER VISION3.0
ROBO550MECHATRONICS3.0
ROBO554ROBOT MECHANICS: KINEMATICS, DYNAMICS, AND CONTROL3.0
ROBO565ADVANCED ROBOT CONTROL3.0
ROBO572ROBOT ETHICS3.0
ROBO576HUMAN-ROBOT INTERACTION3.0

Sample Course Schedule

First Year
Fallleclabsem.hrs
DSCI503ADVANCED DATA SCIENCE  3.0
DSCI570INTRODUCTION TO MACHINE LEARNING  3.0
MATH530INTRODUCTION TO STATISTICAL METHODS  3.0
ELECT Elective   3.0
12.0
Springleclabsem.hrs
DSCI575ADVANCED MACHINE LEARNING  3.0
MATH560INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I  3.0
ELECT Elective  3.0
LICM501PROFESSIONAL ORAL COMMUNICATION  1.0
10.0
Second Year
Fallleclabsem.hrs
MATH561INTRODUCTION TO KEY STATISTICAL LEARNING METHODS II  3.0
ELECT Elective  3.0
SYGN502INTRODUCTION TO RESEARCH ETHICS  1.0
SYGN5XX LEADERSHIP AND TEAMWORK  1.0
8.0
Total Semester Hrs: 30.0

Mines Combined Undergraduate / Graduate Degree Program 

Students enrolled in Mines’ combined undergraduate/graduate program may double count up to six credits of graduate coursework to fulfill requirements of both their undergraduate and graduate degree programs. These courses must have been passed with “B-” or better, not be substitutes for required coursework, and meet all other University, Department, and Program requirements for graduate credit.

Students are advised to consult with their undergraduate and graduate advisors for appropriate courses to double count upon admission to the combined program.

 

Master of Data Science (Non-Thesis) Online

The Master of Data Science (Non-Thesis) program is designed to give candidates a foundation in statistics and computer science and also provide knowledge in a particular application domain of science or engineering. The balance between these three elements is a strength of the program and can prepare candidates for Data Science careers in industry, government, or for further study at the PhD level. Throughout is an emphasis on working in teams, creative problem solving, and professional development.

This non-thesis master’s program gives students a foundation in statistics and computer science, while also providing knowledge in a particular application domain of science or engineering. It is pitched a higher level of statistics and computer science than one would encounter in a typical data analytics curriculum.  

The balance between these elements is a strength of the program and prepares students for data science careers in industry, government or for further study at the PhD level. The emphasis on some foundational knowledge will prepare students to be more innovative in their approach to data analysis and not rely on simply using software packages in a standard way.  Moreover, the three elective courses can be tailored to each student's interests. This program allows students to either increase their skill in data analytics while developing a focused area of application or alternatively to allow a student with depth in one area of application to gain skills in statistics and computer science.

Program

Students take 18 credits of core courses, 3 credits of professional development coursework and at least 9 credits of electives for a minimum total of 30 credits. 

Core Courses

MATH530INTRODUCTION TO STATISTICAL METHODS Students who have passed MATH335 should replace MATH530 with any of the following: MATH500, MATH531, MATH532, MATH533, MATH534, MATH535, MATH536, MATH537, MATH538, or MATH5513.0
MATH560INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I3.0
MATH561INTRODUCTION TO KEY STATISTICAL LEARNING METHODS II3.0
DSCI503ADVANCED DATA SCIENCE3.0
DSCI570INTRODUCTION TO MACHINE LEARNING3.0
DSCI575ADVANCED MACHINE LEARNING3.0
Total Semester Hrs18.0

Professional Development Requirement

Choose courses from the table below in any combination to meet or exceed 3 credits.

EBGN562STRATEGIC DECISION MAKING3.0
EBGN577LEADING & MANAGING HIGH PERFORMING TEAMS3.0
EBGN563MANAGEMENT OF TECHNOLOGY AND INNOVATION3.0
SYGN598SPECIAL TOPICS (Graduate Internship)1-3
SYGN599INDEPENDENT STUDY1-3

Electives

Electives are designed by the student to fulfill their interests and career goals. The intent is to give you the knowledge and skills to apply data science techniques in various fields. Although this individualized module can draw on graduate courses from across the university the courses given below are approved options:

Online Electives
AMFG523DESIGN AND ANALYSIS OF EXPERIMENTS3.0
CBEN524COMPUTER-AIDED PROCESS SIMULATION3.0
CCUS520CLIMATE CHANGE AND SUSTAINABILITY3.0
CSCI585INFORMATION SECURITY PRIVACY3.0
CSCI587CYBER PHYSICAL SYSTEMS SECURITY3.0
EBGN525BUSINESS ANALYTICS3.0
EBGN560DECISION ANALYTICS3.0
EBGN562STRATEGIC DECISION MAKING3.0
EBGN571MARKETING ANALYTICS3.0
EENG509SPARSE SIGNAL PROCESSING3.0
EENG510ADVANCED DIGITAL SIGNAL PROCESSING3.0
EENG515MATHEMATICAL METHODS FOR SIGNALS AND SYSTEMS3.0
EENG519ESTIMATION THEORY AND KALMAN FILTERING3.0
EENG577ADVANCED ELECTRICAL MACHINE DYNAMICS FOR SMART-GRID SYSTEMS3.0
EENG585AI FOR POWER AND RENEWABLE ENERGY SYSTEMS3.0
GEGN575APPLICATIONS OF GEOGRAPHIC INFORMATION SYSTEMS3.0
GEGN580APPLIED REMOTE SENSING FOR GEOENGINEERING AND GEOSCIENCES3.0
GEGN588ADVANCED PLANETARY GEOGRAPHIC INFORMATION SYSTEMS3.0
GEGN590GIS-BASED REAL WORLD LEARNING PROJECT I - FUNDAMENTALS3.0
GEGN592GIS-BASED REAL WORLD LEARNING PROJECT II - ADVANCED APPLICATIONS3.0
GEOL557EARTH RESOURCE DATA SCIENCE 1: FUNDAMENTALS3.0
GEOL558EARTH RESOURCE DATA SCIENCE 2: APPLICATIONS AND MACHINE-LEARNING3.0
GPGN545INTRODUCTION TO DISTRIBUTED FIBER-OPTIC SENSING AND ITS APPLICATIONS3.0
GPGN558SEISMIC DATA INTERPRETATION AND QUANTITATIVE ANALYSIS3.0
MNGN502GEOSPATIAL BIG DATA ANALYTICS3.0

 

Certificate Programs in Data Science

Program Requirements

There are five Certificates in Data Science.  Applicants for each are required to have an undergraduate degree to be admitted into the Certificate programs. Course prerequisites, if any, are noted for each Certificate program.

Students working toward one of the Data Science Certificates are required to successfully complete 12 credits, as detailed below for each Certificate. The courses taken for the Certificates can be used towards a Master’s or PhD degree at Mines, however courses used for one Data Science Certificate cannot also be counted toward another Data Science Certificate. 

Graduate Certificate in Data Science - Statistical Learning (12 credits)

The Data Science - Statistical Learning Graduate Certificate is an online or residential program focusing on statistical methods for interpreting complex data sets and quantifying the uncertainty in a data analysis.  The Certificate also includes gaining new skills in computer science but is grounded in statistical models for data, also termed statistical learning, rather than algorithmic approaches.  Students will develop an essential skill set in statistical methods most commonly used in data science along with the understanding of the methods' strengths and weaknesses.  Moreover, the coursework will cover a broad range of applications making it relevant for varied scientific and engineering domains. 

Applicants must have completed the following courses, or their equivalents, with a B- or better:  CSCI261 and CSCI262 Data Structures, MATH332 Linear Algebra and MATH334 Introduction to Probability.

DSCI503ADVANCED DATA SCIENCE3.0
DSCI530STATISTICAL METHODS I3.0
DSCI560INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I3.0
DSCI561INTRODUCTION TO KEY STATISTICAL LEARNING METHODS II3.0

Graduate Certificate in Data Science - Earth Resources (12 credits) 

The Graduate Certificate in Data Science - Earth Resources is an online program building on the foundational concepts in data science as it pertains to managing surface and subsurface Earth resources and on specific applications (use cases) from the petroleum and minerals industries as well as water resource monitoring and remote sensing of Earth change. The  Certificate includes one core introductory Data Science course, two courses specific to Earth resources and one elective.

DSCI503ADVANCED DATA SCIENCE3.0
GEOL557EARTH RESOURCE DATA SCIENCE 1: FUNDAMENTALS3.0
GEOL558EARTH RESOURCE DATA SCIENCE 2: APPLICATIONS AND MACHINE-LEARNING3.0
ELECTIVE (1) ELECTIVE FROM LIST BELOW3.0


Graduate​ Certificate in Data Science - Earth Resources Electives (select ONE (1) from the list below):

Geospatial Focus:
GEGN575APPLICATIONS OF GEOGRAPHIC INFORMATION SYSTEMS3.0
GEGN579PYTHON SCRIPTING FOR GEOGRAPHIC INFORMATION SYSTEMS3.0
Petroleum Focus:
GPGN519ADVANCED FORMATION EVALUATION3.0
GPGN547PHYSICS, MECHANICS, AND PETROPHYSICS OF ROCKS3.0
GPGN558SEISMIC DATA INTERPRETATION AND QUANTITATIVE ANALYSIS3.0
GPGN651ADVANCED SEISMOLOGY3.0
PEGN522ADVANCED WELL STIMULATION3.0
PEGN551PETROLEUM DATA ANALYTICS - FUNDAMENTALS3.0
Mining Focus:
MNGN548INFORMATION TECHNOLOGIES FOR MINING SYSTEMS3.0
Hydrology Focus:
CEEN581WATERSHED SYSTEMS MODELING3.0
Additional Options:
DSCI/MATH530STATISTICAL METHODS I3.0
EBGN525BUSINESS ANALYTICS3.0

Graduate Certificate in Petroleum Data Analytics (12 credits) 

The Graduate Certificate in Petroleum Data Analytics is an online program building on the foundational concepts in statistics and focusing on the data foundation of the oil and gas industry, the challenges of Big Data to oilfield operations and on specific applications (use cases) for petroleum analytics. The Certificate includes two core introductory Data Science courses and two course specific to petroleum engineering. 

DSCI503ADVANCED DATA SCIENCE3.0
DSCI530STATISTICAL METHODS I3.0
PEGN551PETROLEUM DATA ANALYTICS - FUNDAMENTALS3.0
PEGN552PETROLEUM DATA ANALYTICS - APPLICATIONS3.0

 

Graduate Certificate in Business Analytics

The certificate is an online or residential program. The requirements are to complete the core course and two elective courses:
Core Course
EBGN525BUSINESS ANALYTICS3.0
Elective Courses
EBGN559SUPPLY CHAIN ANALYTICS3.0
EBGN560DECISION ANALYTICS3.0
EBGN571MARKETING ANALYTICS3.0

Course substitutions may be approved on a case-by-case basis by the certificate director.  Completing the certificate will also position students to apply to either the master of science in engineering and technology management degree or the master of science in data science degree, as the certificate courses can be applied to either degree.

 

Graduate Certificate in Data Science - Foundations (12 credits) 

The Data Science - Foundations Graduate Certificate is an online or residential program focusing on the foundational concepts in statistics and computer science that support the explosion of new methods for interpreting data in its many forms. The Certificate balances an introduction to data science with teaching basic skills in applying methods in statistics and machine learning to analyze data.  Students will gain a perspective on the kinds of problems that can be solved by data intensive methods and will also acquire new analysis skills outside of the certificate. Moreover, the coursework will cover a broad range of applications, making it relevant for varied scientific and engineering domains. 

Applicants must have completed the following courses, or their equivalents, with a B- or better:  CSCI261 and CSCI262 Data Structures, MATH332 Linear Algebra and MATH334 Introduction to Probability.

DSCI503ADVANCED DATA SCIENCE3.0
DSCI570INTRODUCTION TO MACHINE LEARNING3.0
MATH530INTRODUCTION TO STATISTICAL METHODS3.0
MATH560INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I

Graduate Certificate in Data Science - Computer Science (12 credits) 

The Data Science - Computer Science Graduate Certificate is an online or residential program focusing on data science concepts within computer science (e.g., computational techniques and machine learning) plus prerequisite knowledge (e.g., probability and regression). The aim of this certificate is to help students develop an essential skill set in data analytics, including (1) deriving predictive insights by applying advanced statistics, modeling, and programming skills, (2) acquiring in-depth knowledge of machine learning and computational techniques, and (3) unearthing important questions and intelligence for a range of industries, from product design to finance.

Applicants must have completed the following courses, or their equivalents, with a B- or better:  CSCI261 and CSCI262 Data Structures, MATH213 Calculus III and MATH332 Linear Algebra.  DSCI530 Statistical Methods I, will serve as the MATH201 Probability and Statistics prerequisite for the two machine learning courses of the certificate (DSCI570 Introduction to Machine Learning and DSCI575 Machine Learning).

DSCI503ADVANCED DATA SCIENCE3.0
MATH530INTRODUCTION TO STATISTICAL METHODS3.0
DSCI570INTRODUCTION TO MACHINE LEARNING3.0
DSCI575ADVANCED MACHINE LEARNING3.0

Courses

DSCI503. ADVANCED DATA SCIENCE. 3.0 Semester Hrs.

(I, II) This course will teach students the core skills needed for gathering, cleaning, organizing, analyzing, interpreting, and visualizing data. Students will use the python programming language and related toolkits for data manipulation and the use and application of statistical and machine learning for data analysis. The course will be primarily focused on applications, with an emphasis on working with real (non-synthetic) datasets. Students will propose and design a semester project using a dataset from their domain of interest, leveraging the concepts and skills acquired from this course (e.g., data analysis, ethical considerations, evaluation and synthesis of results, storytelling and visualization). Prerequisite: CSCI200 with a grade of C- or higher or CSCI262 with a grade of C- or higher, MATH201 or MATH334 OR Graduate level standing and at least CSCI128 or equivalent.

View Course Learning Outcomes

View Course Learning Outcomes
  • Conduct data acquisition using a varied set of techniques structured and unstructured datasets; including raw data files, SQL databases, online repositories, and programmatically through web scraping and APIs.
  • Apply preprocessing strategies to complex and dynamic datasets using industry-standard toolkits and machine learning algorithms to extract features, reduce dimensionality, remove errors, inconsistencies, and missing values.
  • Differentiate between machine learning approaches such as classification, regression, clustering, and neural networks for predictive analytics and pattern recognition.
  • Evaluate the predictive power of the different statistical and machine learning methods to solve real-world data science problems.
  • Develop storytelling and visualization techniques to effectively communicate (exploratory) or persuade (explanatory) findings to a specific audience.
  • Critically asses ethical considerations and challenges related to data collection and analysis.
  • Construct a comprehensive data science project from inception to presentation, integrating the various techniques and tools learned throughout the course.

DSCI530. STATISTICAL METHODS I. 3.0 Semester Hrs.

Introduction to probability, random variables, and discrete and continuous probability models. Elementary simulation, data summarization and analysis using the R Data Analysis Environment. Confidence intervals and hypothesis testing for means and variances. Chi square tests. Distribution-free techniques and regression analysis. Students are expected to have knowledge of probability covered in MATH334 or an equivalent course. Prerequisite: MATH334 or equivalent.

View Course Learning Outcomes

DSCI560. INTRODUCTION TO KEY STATISTICAL LEARNING METHODS I. 3.0 Semester Hrs.

Part one of a two-course series introducing statistical learning methods with a focus on conceptual understanding and practical applications. Methods covered will include Introduction to Statistical Learning, Linear Regression, Classification, Resampling Methods, Basis Expansions, Regularization, Model Assessment and Selection. Prerequisite: DSCI530 or MATH530.

View Course Learning Outcomes

DSCI561. INTRODUCTION TO KEY STATISTICAL LEARNING METHODS II. 3.0 Semester Hrs.

Equivalent with MATH561,
Part two of a two course series introducing statistical learning methods with a focus on conceptual understanding and practical applications. Methods covered will include Non-linear Models, Tree-based Methods, Support Vector Machines, Neural Networks, Unsupervised Learning. Prerequisite: DSCI560 or MATH560.

View Course Learning Outcomes

DSCI570. INTRODUCTION TO MACHINE LEARNING. 3.0 Semester Hrs.

(I, II) The goal of machine learning is to build computer systems that improve automatically with experience, which has been successfully applied to a variety of application areas, including, for example, gene discovery, financial forecasting, and credit card fraud detection. This introductory course will study both the theoretical properties of machine learning algorithms and their practical applications. Students will have an opportunity to experiment with machine learning techniques and apply them to a selected problem in the context of term projects. Graduate students must complete a more challenging project that utilizes complex machine learning algorithms, requiring a deeper understanding of machine learning approaches and critical thinking. Prerequisite: DSCI503.

View Course Learning Outcomes

View Course Learning Outcomes
  • Apply supervised, unsupervised, reinforcement machine learning models and deep learning models to solve problems in areas such as prediction, recognition and classification.
  • Explore and develop with various tools, techniques and libraries in Python for data processing, feature extraction, visualization, validation and evaluation.
  • Create data visualization tools, techniques, and libraries in Python to visualize high dimensional or complex data for stakeholders.
  • Determine ethical implications through interpretability of big data and results from the application of various machine learning models.
  • Design and develop a machine learning product that solves their chosen real-world challenge.
  • Create a video presentation that succinctly outlines the problem, solutions, conclusions, and lessons learned regarding product development for the stakeholders.

DSCI575. ADVANCED MACHINE LEARNING. 3.0 Semester Hrs.

The goal of machine learning research is to build computer systems that learn from experience and that adapt to their environments. Machine learning systems do not have to be programmed by humans to solve a problem; instead, they essentially program themselves based on examples of how they should behave, or based on trial and error experience trying to solve the problem. This course will focus on the methods that have proven valuable and successful in practical applications. The course will also contrast the various methods, with the aim of explaining the situations in which each is most appropriate. Prerequisite: DSCI570.

View Course Learning Outcomes

DSCI576. DEEP LEARNING. 3.0 Semester Hrs.

Machine Learning is a key component of Artificial Intelligence that allows computers to generate accurate outcomes without being explicitly programmed. Over the past decade, machine learning has powered speech recognition, augmented reality, self-driving vehicles, and biometric authentications. The purpose of this course is to bridge general machine learning concepts to real world applications. This course provides a broad introduction to machine learning (supervised learning, unsupervised learning, feature learning), deep learning, computer vision, natural language processing, and reinforcement learning. During the course, students will design a project that applies concepts from class to several different real-world problems and challenges and create applications to solve them. Prerequisite: CSCI470.

View Course Learning Outcomes

View Course Learning Outcomes
  • Design deep learning models to address supervised and unsupervised learning tasks implemented in Python that meet evaluation criteria such as accuracy, F1 score, and mean-square error.
  • Apply state-of-the-art deep learning architectures for computer vision tasks involving object detection, image generation, and self-driving vehicles.
  • Develop deep learning models for natural language processing tasks using pre-trained transformers.
  • Utilize Markov Decision Process, Monte Carlo Method, and deep reinforcement learning techniques to solve sequential decision-making problems.
  • Evaluate ethical challenges in deep learning models, including bias, fairness, transparency, and interpretability.
  • Demonstrate clear communication of deep learning results and insights to both technical and non-technical audiences.

DSCI598. SPECIAL TOPICS. 0-6 Semester Hr.

View Course Learning Outcomes

DSCI598. SPECIAL TOPICS. 1-6 Semester Hr.

View Course Learning Outcomes

DSCI598. SPECIAL TOPICS. 1-6 Semester Hr.

View Course Learning Outcomes

DSCI598. SPECIAL TOPICSE. 1-6 Semester Hr.

View Course Learning Outcomes

DSCI598. DATA SCIENCE. 1-6 Semester Hr.

View Course Learning Outcomes

DSCI599. INDEPENDENT STUDY. 0.5-6 Semester Hr.

View Course Learning Outcomes

DSCI599. INDEPENDENT STUDY. 0.5-6 Semester Hr.

View Course Learning Outcomes