EICTA, IIT Kanpur

Data Analyst Course Syllabus 2026: Learn SQL, Python, Excel & More

E&ICTA29 December 2025

A data analyst course in 2026 covers the tools, techniques, and frameworks needed to collect, clean, analyse, and present data in ways that drive real business decisions. The core modules in a complete data analyst syllabus are:

Quick module overview:

Module Tools Covered What You Can Do After
Excel for Data Analysis Excel, Power Query, Pivot Tables Clean and analyse business data, build dashboards
SQL and Database Management MySQL, PostgreSQL, BigQuery Extract and query data from any relational database
Python for Data Analytics Pandas, NumPy, Matplotlib, Seaborn Automate analysis, handle large datasets, visualise trends
Statistics and Probability Statistical methods, hypothesis testing Validate findings, design experiments, interpret results
Data Visualisation Power BI, Tableau Build interactive dashboards and executive reports
Machine Learning Basics Scikit-learn, regression, classification Build predictive models, improve forecast accuracy
Capstone and Portfolio Real datasets, end-to-end projects Present to employers with documented, real project work

The global data analytics market is expected to reach $132.9 billion by 2026. Businesses in every sector, from banking and healthcare to e-commerce and manufacturing, rely on data analysts to find patterns in complex datasets, identify the factors driving performance, and present insights that inform decisions. The demand for qualified data analysts in India is growing faster than the supply of trained professionals, which means this is one of the most favourable job markets in the country for people entering the field.

Best Data Analyst Course Online: Enroll Now!

What Is a Data Analyst and What Do They Do?

A data analyst is a professional who collects data from business systems, cleans and organises it, analyses it to identify meaningful patterns, and presents findings in a clear format that non-technical stakeholders can understand and act on.

In practice, a data analyst at an e-commerce company might investigate why customer return rates have increased in a specific product category, building a query to extract the relevant data, cleaning it to remove duplicates and errors, running an analysis to identify whether returns are concentrated in a specific time window or customer segment, and presenting the findings with a Power BI dashboard that the category management team can use to make decisions.

At a hospital, a data analyst might analyse patient flow data to identify which times of day the emergency department is most overloaded, build a predictive model that forecasts daily admission volumes, and present a staffing recommendation to the operations team.

The common thread is a structured process: data in, insight out, decision enabled. Every module in the EICTA Consortium data analyst course builds one part of this capability.

Must Read: Data Analytics Technique

Data analyst versus data scientist: what is the difference?

This is the most common question prospective students ask before enrolling. The distinction matters for choosing the right course.

A data analyst focuses on understanding what has happened and why, using existing data to answer specific business questions. The core tools are SQL, Excel, Python with Pandas, and visualisation platforms like Power BI and Tableau.

A data scientist builds predictive and machine learning models to answer what will happen next and what should we do, working with larger and more complex datasets and using more advanced statistical and programming tools.

Most career paths in data start with data analysis. Building strong fundamentals in SQL, Python, and statistics creates the foundation that data science roles build upon. A data analyst course is the correct starting point for most people entering the field regardless of their eventual specialisation.

Module 1: Introduction to Data Analytics

The foundation module establishes the conceptual framework for everything that follows. Students learn what data analytics actually involves in practice, how it differs from data science and business intelligence, and where a data analyst fits within an organisation.

What this module covers:

The four types of data analytics: descriptive (what happened), diagnostic (why it happened), predictive (what will happen), and prescriptive (what should we do). Understanding these four levels helps students see how the technical skills they learn in later modules connect to real business applications.

Data sources and data types: where business data comes from (databases, APIs, spreadsheets, CRM systems, sensor feeds), how structured data (stored in tables with defined rows and columns) differs from unstructured data (text, images, logs), and why the distinction matters for choosing the right analysis approach.

The data analyst workflow: from problem definition through data collection, cleaning, analysis, and communication. Understanding the end-to-end workflow before learning individual tools prevents the common mistake of learning tools in isolation without understanding how they connect.

Career pathways: the roles available to qualified data analysts in India in 2026, the industries hiring most actively, typical salary expectations, and the skill progression from junior analyst to senior analyst and beyond.

Also Read: Data Analyst Skills

Module 2: Excel for Data Analysis

Excel remains the most widely used tool for data analysis in Indian businesses. Excel is the most widely used tool for day-to-day data work. Even with new tools coming up, Excel remains the place where most teams store, clean, and explore their data. A data analyst who cannot use Excel effectively will struggle in most corporate environments regardless of their Python or SQL skills.

What this module covers:

Core Excel for data analysis: Sorting, filtering, conditional formatting, data validation, VLOOKUP, HLOOKUP, INDEX-MATCH, SUMIF, COUNTIF, and pivot tables. These are the functions that appear in every data analyst's daily work. A pivot table that summarises sales by region and product category in thirty seconds would take hours to produce manually.

Advanced Excel capabilities: Power Query for connecting to external data sources and automating data transformation, Power Pivot for creating data models that handle datasets too large for standard Excel, and dynamic dashboards that update automatically when source data changes.

Data cleaning in Excel: Real business data is messy. It contains duplicates, inconsistent formats, missing values, and errors. This section teaches the specific Excel techniques used to identify and correct these issues before analysis begins. Poor data quality is the most common reason analysis produces misleading results.

Data storytelling with Excel charts: Choosing the right chart type for the question being answered, formatting charts to communicate clearly rather than impressively, and building a simple executive-ready dashboard from raw data.

Also: Excel for Data Analytics

Module 3: SQL and Database Management

SQL is the most essential skill for modern data analysts because real business data lives in databases. When a company stores customer orders, product inventory, financial transactions, or website behaviour data, it almost certainly stores it in a relational database that is queried using SQL.

What this module covers:

SQL fundamentals: Understanding what relational databases are and how they are structured. Writing SELECT statements to retrieve data, using WHERE clauses to filter results, and sorting and limiting output. A data analyst who can write a clean SELECT statement with filtering and sorting can answer most basic business questions independently.

Data manipulation: Using GROUP BY with aggregate functions (SUM, COUNT, AVG, MIN, MAX) to summarise data, which is the foundation of most business reporting. Understanding the difference between GROUP BY and WHERE, and when to use HAVING for filtering aggregated results.

Joins: Combining data from multiple tables using INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL OUTER JOIN. This is where SQL becomes genuinely powerful. A customer order table joined to a product table joined to a customer profile table can answer questions that no individual table could answer alone.

Subqueries and Common Table Expressions (CTEs): Writing more complex queries that layer analysis in readable, manageable steps. Essential SQL skills include handling NULL values, using CASE WHEN for conditional logic, understanding basic indexing for performance, and working with different data types.

Working with real databases: Practising queries against realistic business datasets in MySQL or PostgreSQL, understanding database design principles, and learning to read an entity-relationship diagram so you can understand an unfamiliar database structure quickly.

Also Read: Data Analytics Methods

Module 4: Python for Data Analytics

Python is the stage where learners move beyond spreadsheets and start thinking like analysts. Beginning with data manipulation, once you learn to reshape and clean datasets using Pandas or NumPy, the rest becomes quite easier. Python is the bridge between Excel-level analysis and the more sophisticated work that distinguishes strong data analysts from average ones. It handles datasets too large for Excel, automates repetitive tasks, connects to APIs and web data sources, and provides the foundation for machine learning.

What this module covers:

Python fundamentals: Variables, data types (integers, floats, strings, lists, dictionaries), control flow (if-else logic, for loops, while loops), and functions. These are not taught as abstract programming concepts but as practical tools for solving data problems.

Pandas for data manipulation: Pandas is the Python library that data analysts use daily. Loading data from CSV, Excel, and database connections, inspecting dataset structure and data types, handling missing values, filtering and sorting data, grouping and aggregating, merging datasets (the Python equivalent of SQL joins), and reshaping data with pivot operations. A data analyst fluent in Pandas can handle almost any data transformation task efficiently.

NumPy for numerical computation: Array operations, mathematical calculations, and the statistical functions that underpin more advanced analysis. NumPy is the computational engine that Pandas and Scikit-learn are built on.

Data visualisation with Matplotlib and Seaborn: Creating line charts, bar charts, scatter plots, histograms, and heatmaps programmatically. Visualising data distributions, correlations, and trends as part of exploratory data analysis before deeper analysis begins.

Automation with Python: Writing scripts that automate data collection, transformation, and reporting tasks that would otherwise be done manually each week. For many data analysts, automation alone makes their Python investment pay off rapidly.

Module 5: Statistics and Probability for Data Analysis

Statistics is what separates data analysts who describe what they see from those who can validate whether what they see is real and what it means for business decisions. A chart showing that sales increased after a campaign is a description. A hypothesis test confirming that the increase is statistically significant and not random variation is a validated finding.

What this module covers:

Descriptive statistics: Mean, median, mode, standard deviation, variance, and percentiles. Understanding distributions: normal distribution, skewness, and kurtosis. Box plots and histograms for visualising distributions. These concepts are used in every exploratory data analysis.

Inferential statistics: Moving from a sample to conclusions about a population. Sampling theory, confidence intervals, and the central limit theorem. Understanding when conclusions from data analysis are reliable and when they might be misleading due to sample size or selection bias.

Hypothesis testing: The framework for validating whether an observed difference or pattern is statistically significant. T-tests for comparing means, chi-square tests for categorical data, ANOVA for comparing multiple groups. Practically, this is used for A/B testing: did the new product page design actually improve conversion, or did it just happen to coincide with higher traffic?

Regression analysis: Linear regression for understanding relationships between variables and making predictions. Interpreting regression coefficients, R-squared values, and p-values in business terms. This is the most widely used analytical method in business data analysis.

Probability fundamentals: Basic probability, conditional probability, Bayes theorem, and probability distributions. These concepts underpin both statistical analysis and the machine learning models in the advanced module.

Module 6: Data Visualisation with Power BI and Tableau

Numbers become decisions only when they are communicated clearly to the people who need to act on them. Dashboards, metrics translated into business insights, and leadership reporting support are core skills for modern data analysts.

What this module covers:

Power BI: Microsoft's business intelligence platform is one of the most widely deployed visualisation tools in Indian enterprises. The module covers connecting to multiple data sources, building data models with relationships between tables, creating calculated columns and measures using DAX (Data Analysis Expressions), designing interactive dashboards with filters and drilldowns, and publishing and sharing reports within an organisation.

Tableau: Used extensively in technology, consulting, and multinational companies. Covers the Tableau interface, connecting to data, building worksheets and dashboards, using calculated fields, and creating the kind of polished interactive visualisations that senior stakeholders use for decision-making.

Principles of effective data visualisation: Choosing the right chart type for the question being answered (a bar chart for comparison, a line chart for trend, a scatter plot for correlation), removing chart clutter that obscures the insight, using colour and annotation to direct attention, and telling a coherent analytical story rather than presenting a collection of charts.

Dashboard design for business audiences: Understanding what decision-makers need to see versus what analysts find interesting. Building executive dashboards that answer the three or four questions leadership asks most frequently, with the ability to drill down to underlying detail when needed.

Module 7: Introduction to Machine Learning for Data Analysts

Machine learning is increasingly expected knowledge for data analyst roles in 2026. Analysts do not need to build production ML models from scratch, but they need to understand how machine learning works, what problem types it is suited for, and how to evaluate whether a model is performing well.

What this module covers:

Supervised versus unsupervised learning: Supervised learning (where the model learns from labelled examples to predict a target variable) covers regression for predicting numerical outcomes and classification for predicting categorical outcomes. Unsupervised learning (where the model finds patterns without labelled targets) covers clustering and dimensionality reduction.

Building models with Scikit-learn: The Python machine learning library that data analysts use for most practical ML tasks. Training linear regression models, decision trees, and random forests. Understanding the train-test split and cross-validation as the correct approach to building models that generalise to new data.

Feature engineering: Selecting and transforming input variables to improve model performance. Understanding which variables matter, how to handle categorical variables, and how to deal with missing data in a machine learning context.

Model evaluation: Accuracy, precision, recall, F1-score, and the confusion matrix for classification models. Mean absolute error, root mean square error, and R-squared for regression models. Understanding which metric matters most for a given business problem.

Practical applications: Demand forecasting, customer churn prediction, product recommendation, and fraud detection are the most common applications of ML in Indian businesses. The module connects each technique to real use cases so students understand when to apply them.

Module 8: Capstone Project and Portfolio Building

A capstone is the chance to turn everything learned into something shareable with clients and recruiters. Building projects that solve an actual problem is valuable because recruiters look for proof that a candidate can work with data end-to-end.

The capstone module is where theoretical knowledge becomes demonstrable skill. Students complete an end-to-end project on a real business dataset that covers every stage of the data analyst workflow.

What the capstone involves:

Selecting a real business problem and dataset from the capstone options provided (marketing attribution analysis, supply chain optimisation, hospital resource planning, or customer churn analysis). Cleaning and preparing the data using Python and Excel. Running the analysis using SQL queries and Python. Visualising findings in Power BI or Tableau. Presenting conclusions with specific, quantified business recommendations backed by the data.

The completed capstone project, along with two to three smaller module projects, forms the foundation of a professional portfolio. A portfolio that demonstrates end-to-end analytical capability is consistently cited by hiring managers as the most influential factor in shortlisting junior data analyst candidates.

Career Paths After a Data Analyst Course

Completing a structured data analyst course opens several specific career paths. The role you pursue depends on the industry you target and the skills you develop most strongly during the programme.

Data Analyst: The direct entry point for most graduates. Works with business stakeholders to answer specific questions using data. Typical tools are SQL, Excel, Python, and Power BI or Tableau. Data analysts are in demand across finance, healthcare, retail, marketing, and technology.

Business Analyst: Combines data analysis with business process understanding to recommend operational improvements. More stakeholder-facing than a pure data analyst role, with strong communication and requirements-gathering skills valued alongside technical capability.

Business Intelligence (BI) Analyst: Specialises in building and maintaining the dashboards, reports, and data models that an organisation uses for performance monitoring. Heavy focus on Power BI, Tableau, SQL, and data modelling.

Marketing Analyst: Applies data analysis to marketing performance: campaign attribution, customer segmentation, A/B testing, and funnel analysis. Strong demand in e-commerce, D2C brands, and marketing agencies.

Financial Analyst: Uses data analysis for financial planning, budget variance analysis, cost optimisation, and risk assessment. SQL and Excel are core tools; Python is increasingly expected.

Data Scientist (with further development): Building on the machine learning module, experienced data analysts who develop stronger ML and statistical modelling skills can transition into data science roles. This typically takes one to two years of additional focused development after the analyst role.

Data Analyst Salary in India 2026

Understanding the earning potential at each career stage is important for making an informed decision about investing in this course.

Experience Level Typical Salary Range (India)
Fresher (0 to 1 year) Rs. 3.5 to Rs. 7 LPA
Junior Analyst (1 to 3 years) Rs. 6 to Rs. 12 LPA
Mid-Level Analyst (3 to 6 years) Rs. 10 to Rs. 18 LPA
Senior Analyst (6+ years) Rs. 16 to Rs. 28 LPA
Analytics Manager Rs. 22 to Rs. 40 LPA

Salary varies significantly based on industry, company size, city, and technical skill depth. Analysts with strong SQL and Python skills, proficiency in Power BI or Tableau, and the ability to communicate findings clearly consistently earn at the upper end of these ranges.

Also Read: Data Analyst Salary

Industries paying the highest salaries for data analysts in India in 2026 are financial services (banking, fintech, and investment management), e-commerce and quick commerce, healthcare and pharmaceutical companies with digital health ambitions, and technology companies and IT services firms serving global clients.

Who Should Enroll in the EICTA Consortium Data Analyst Course?

Fresh graduates and students looking to enter the job market with a skill set that is in demonstrably strong demand. The course is designed to be accessible to students from any background, whether engineering, commerce, or arts, as long as they have basic mathematical aptitude and genuine interest in working with data.

Working professionals from non-technical backgrounds who want to transition into data roles. Marketing professionals, finance teams, operations coordinators, and HR professionals who currently work with data but want to formalise and deepen their analytical skills find this course particularly valuable. The tools taught in this course are the ones their employers already use.

Early-career professionals in adjacent roles who want to move from their current position into a data analyst role. Business analysts who lack technical skills, junior finance professionals who want to add Python and SQL to their toolkit, and entry-level IT professionals who want to move toward analytics are all well-suited to this programme.

Managers and business leaders who need to understand data well enough to work effectively with their analytics teams, evaluate data-driven recommendations, and make better decisions from the dashboards and reports their teams produce.

Frequently Asked Questions

What is the syllabus of a data analyst course in 2026?

A complete data analyst course syllabus covers Excel for data manipulation and reporting, SQL for querying databases, Python for data analysis using Pandas and NumPy, statistics and probability for validated analysis, data visualisation using Power BI and Tableau, an introduction to machine learning, and a capstone project that demonstrates end-to-end capability. The EICTA Consortium programme covers all of these modules with real business datasets and industry case studies throughout.

Do I need a coding background to enroll in a data analyst course?

No prior coding experience is required. Python and SQL are taught from the fundamentals, starting with the basic syntax and building to practical data analysis applications. Most successful graduates of data analyst programmes enter without programming experience. What matters is a willingness to practise consistently, basic comfort with mathematics, and genuine interest in working with data.

How long does it take to complete a data analyst course?

Structured programmes typically run from three to six months depending on the depth of coverage and the pace of the programme. The EICTA Consortium course is designed to build job-ready skills within this timeframe through a combination of conceptual instruction, hands-on practice, and real project work. Self-directed learning without structure typically takes significantly longer and produces inconsistent results.

What jobs can I get after completing a data analyst course?

Graduates can apply for roles including data analyst, business analyst, business intelligence analyst, marketing analyst, financial analyst, and operations analyst. In India in 2026, these roles are in strong demand across financial services, e-commerce, healthcare, FMCG, and technology companies. Starting salaries for fresh graduates with a strong portfolio typically range from Rs. 3.5 to Rs. 7 LPA depending on the employer and city.

Is data analysis better than data science as a career path?

Data analysis is the right starting point for most people. It builds the foundational skills in SQL, Python, and statistics that both fields require, and it provides the business context and communication skills that many data scientists lack. Starting in data analysis, developing a strong track record, and then building additional machine learning and modelling skills to move into data science is the path that most successful data scientists in India have followed. Jumping directly into data science without analytical foundations typically produces professionals who can build models but cannot interpret their business implications.

What tools will I learn in a data analyst course?

The core tools are Microsoft Excel (including Power Query and Pivot Tables), SQL (in MySQL or PostgreSQL), Python (with Pandas, NumPy, Matplotlib, and Seaborn), Power BI or Tableau for visualisation, and Scikit-learn for machine learning basics. These are the specific tools that appear most frequently in Indian data analyst job postings and that hiring managers expect graduates to demonstrate proficiency in during technical interviews.

What is the difference between a data analyst and a data scientist?

A data analyst answers specific business questions using existing data, primarily using SQL, Excel, Python, and visualisation tools. A data scientist builds predictive and machine learning models to forecast outcomes and discover patterns in complex datasets, using more advanced statistical techniques and requiring stronger programming skills. Most data science careers begin with data analysis experience, making the analyst role both a valuable career in its own right and a stepping stone to more advanced technical roles.

Customer Support

Subscribe for expert insights and updates on the latest in emerging tech, directly from the thought leaders at EICTA consortium.