Course Content
KM-01: Introduction to RPA and Digital Transformation
This module introduces learners to the fundamentals of Robotic Process Automation (RPA), digital transformation, and automation technologies used in modern business environments. Learners will explore how businesses use automation to improve efficiency, reduce repetitive tasks, and support digital innovation.
0/7
KM-04: Computing Theory
This module introduces learners to the foundational principles of programming and computing theory used in software development and automation environments. Learners will explore programming languages, programming logic, algorithms, variables, operators, loops, functions, and software applications commonly used in modern computing systems. The module also introduces concepts related to web technologies, databases, artificial intelligence, and software development methodologies.
0/7
KM-05: Data, Databases and Data Scraping
This module introduces learners to the principles of data management, databases, and data scraping used in modern digital and automation environments. Learners will explore how organisations collect, store, analyse, secure, and visualise data to support business processes and decision-making. The module also introduces structured query language (SQL), relational databases, web scraping techniques, and software tools used for analysing and visualising data in automation and RPA environments.
0/13
KM-06: Introduction to RPA for Automation of Processes
This module introduces learners to the foundational concepts, technologies, and processes involved in Robotic Process Automation (RPA). Learners will explore automation principles, business process analysis, workflow automation, process mapping, bots, attended and unattended automation, and the role of RPA in improving operational efficiency. The module also examines how organisations identify processes suitable for automation and how RPA supports digital transformation initiatives.
0/7
KM-07: Robotic Process Automation (RPA)
This module focuses on building an understanding of how to use a toolkit or platform, using a vendor-specific approach, for the creation and deployment of automated processes. Learners will explore variables, arguments, automation selectors, control flow, data manipulation, automation concepts, automation management, and methods used to secure the RPA ecosystem from security risks. The module develops practical knowledge required to build, manage, and support automation solutions within modern RPA environments.
0/15
KM-08: Introduction to RPA Governance, Legislation and Ethics
This module introduces learners to governance, legislation, compliance, ethics, and responsible practices within Robotic Process Automation (RPA) environments. Learners will explore legal requirements, organisational governance, ethical considerations, compliance frameworks, privacy protection, intellectual property, accountability, and professional conduct related to automation technologies. The module also examines how organisations manage risk, maintain compliance, and ensure ethical use of RPA systems within modern digital business environments.
0/19
KM-09: Fundamentals of Design Thinking and Innovation
This module introduces learners to the fundamentals of design thinking and innovation within modern business and technology environments. Learners will explore design thinking principles, human-centered design, creativity, innovation, design concepts, design thinking methodologies, and the practical application of design thinking in software development, cybersecurity, and business problem-solving. The module focuses on developing innovative thinking, problem-solving skills, and creative approaches used in modern workplaces and digital transformation environments.
0/15
KM-10: 4IR and Future Skills
This module focuses on building an understanding of the impact of the Fourth Industrial Revolution (4IR) on communities, individuals, and businesses, as well as the future skills required in modern digital environments. Learners will explore emerging 4IR technologies, computing knowledge, future skills and competencies, business trends, interpersonal and intrapersonal skills, communication methods, workplace teamwork, customer service, and professional workplace practices required within modern organisations and Robotic Process Automation (RPA) environments.
0/29
PM-01: Basic Calculations for Programming
This practical module introduces learners to the mathematical and computational concepts required in programming and automation environments. Learners will develop practical skills in number systems, measurement conversions, mathematical operations, scientific notation, logical calculations, and computational problem solving. The module focuses on applying calculations and numerical reasoning in software development and Robotic Process Automation (RPA) environments. Learners will complete practical activities that strengthen analytical thinking, accuracy, and computational problem-solving skills required in modern digital workplaces.
0/16
PM-02: Basic Programming
This practical module introduces learners to fundamental programming concepts, software toolkits, coding environments, programming paradigms, data types, APIs, functions, logical operations, loops, SQL queries, error handling, and software development processes used in Robotic Process Automation (RPA) environments. Learners will develop practical programming skills by creating coding environments, writing and testing code, working with variables and functions, integrating APIs, handling errors, and developing simple automation solutions using industry-relevant software toolkits and platforms.
0/15
PM-03: Access, Analyse and Visualise Structured Data Using Spreadsheets and Scraping Tools
This practical module focuses on developing the skills required to access, analyse, organise, transform, visualise, and report structured data using spreadsheets, dashboards, pivot tables, databases, and web scraping tools within a Robotic Process Automation (RPA) environment. Learners will work with spreadsheet reporting, dashboards, pivot tables, SQL imports, data models, charts, and web scraping techniques to process and visualise data for business decision-making.
0/12
PM-05: Execute Test Procedures for Evaluating the RPA Solution Performance
This practical module focuses on developing the practical skills required to prepare, execute, evaluate, and improve test procedures for Robotic Process Automation (RPA) solutions. Learners will work with test cases, testing methodologies, simulation tools, workflow evaluations, exception handling, and remedial actions to determine whether an RPA solution passes or fails according to business and technical requirements. Learners will also develop the ability to analyse automation outcomes, identify application and workflow issues, document test evidence, and apply corrective actions to improve automation reliability and performance.
0/4
PM-06: Deploy RPA Solutions Which Emulate Actions of a Human Interacting Within Digital Systems
This practical module focuses on developing the practical skills required to deploy, schedule, monitor, manage, and maintain Robotic Process Automation (RPA) solutions within production environments. Learners will work with unattended and attended robots, deployment procedures, process documentation, auditing dashboards, scheduling systems, and RPA environment management tools. Learners will also develop the ability to schedule automated workflows, deploy bots into production environments, update process documentation, train end-users, monitor runtime activities, and import or export automation solutions between environments.
0/7
PM-07: Modify and Improve Existing RPA Solutions
This practical module focuses on developing the practical skills required to troubleshoot, improve, maintain, and optimise existing Robotic Process Automation (RPA) solutions within operational environments. Learners will work with debugging tools, workflow optimisation techniques, infrastructure changes, software upgrades, regulatory requirements, and process improvement strategies to ensure that automation workflows continue to operate efficiently and reliably. Learners will also develop the ability to investigate alternative solutions, apply continuous improvement techniques, manage changes in technical environments, explore workflow scalability, and update robotic workflows when organisations upgrade RPA software versions.
0/6
PM-08: Function Ethically and Effectively as a Member of a Multidisciplinary Team
This practical module focuses on developing the practical skills required to function ethically, professionally, and collaboratively within multidisciplinary Robotic Process Automation (RPA) environments. Learners will work with business analysts, solution architects, DevOps teams, infrastructure engineers, project managers, business users, and stakeholders throughout the automation life cycle. Learners will also develop the ability to communicate effectively, collaborate across departments, support business process automation initiatives, engage with stakeholders ethically, adapt to organisational policies and infrastructure changes, and contribute to teamwork and business optimisation activities.
0/7
PM-09: Apply Design Thinking Methodologies
This practical module focuses on developing the practical skills required to apply Design Thinking methodologies within problem-solving and innovation environments. Learners will collaborate with multidisciplinary teams to investigate problems, generate innovative ideas, develop prototypes, and test solutions using the Design Thinking process. Learners will also develop the ability to engage in collaborative discussions, participate in innovation workshops, analyse user needs, challenge assumptions, generate creative solutions, and apply the five Design Thinking phases: Empathize, Define, Ideate, Prototype, and Test.
0/3
Occupational Certificate: Robotic Process Automation (RPA) Developer

Lesson Overview

This lesson introduces learners to the concepts, principles, and processes involved in data scraping. Learners will explore how organisations collect information from websites and digital platforms using scraping tools and automation technologies. The lesson also examines web scraping procedures, legal considerations, and common libraries used in data scraping environments.

Lesson Outcomes

After completing this lesson, learners will be able to:

  • Define data scraping and explain its purpose

  • Identify common data scraping tools

  • Explain legal and ethical considerations related to data scraping

  • Describe the web scraping process

  • Identify libraries commonly used for web scraping

  • Explain how data scraping supports automation and business processes


KT0401: Concept and Definition

Data scraping refers to the automated process of extracting information from websites, databases, or digital platforms.

Web scraping is commonly used to collect large amounts of information quickly and efficiently.

Instead of manually copying information from websites, automated tools and scripts gather data automatically.

Data scraping may involve extracting:

  • Product information

  • Prices

  • Customer reviews

  • Market trends

  • Financial data

  • Contact information

  • News articles

Data scraping is important because organisations rely on data to support:

  • Business analysis

  • Automation

  • Reporting

  • Research

  • Decision-making

In RPA environments, bots often perform scraping activities automatically as part of larger automation workflows.


KT0402: Purpose of Data Scraping

The purpose of data scraping is to collect useful information efficiently from online or digital sources.

Organisations use data scraping to:

  • Gather market information

  • Monitor competitors

  • Analyse customer behaviour

  • Generate reports

  • Collect research data

  • Support automation processes

Examples of data scraping applications include:

Industry Example
Retail Monitoring product prices
Finance Collecting stock market data
Marketing Gathering customer trends
Recruitment Collecting job listings
Research Gathering online information

Data scraping improves productivity because large amounts of information can be collected automatically and processed quickly.


KT0403: Data Scraping Tools

Data scraping tools are software applications or frameworks used to collect and process information from websites and systems.

Common data scraping tools include:

  • Beautiful Soup

  • Scrapy

  • Selenium

  • UiPath scraping tools

  • Octoparse

  • ParseHub

Features of Data Scraping Tools

These tools may provide:

  • Automated data extraction

  • Web navigation

  • Data storage

  • Browser automation

  • Data filtering

  • Integration with databases

In RPA environments, scraping tools are often integrated into automation workflows to process information automatically.


KT0404: Legal Issues

Although data scraping is widely used, organisations must ensure that data collection activities comply with legal and ethical requirements.

Common legal considerations include:

  • Privacy regulations

  • Copyright laws

  • Website terms and conditions

  • Data protection laws

  • Intellectual property rights

Improper scraping activities may result in:

  • Legal penalties

  • Security violations

  • Privacy breaches

  • System blocking

Organisations must ensure that:

  • Sensitive information is protected

  • Data is collected ethically

  • Scraping activities comply with regulations

  • User privacy is respected

Responsible data scraping practices are important in automation and digital business environments.


KT0405: Web Scraping Procedure

Web scraping follows a structured process to collect and organise information.

Step 1: Find the URL to Scrape

The first step is identifying the webpage or online source containing the required information.

Step 2: Inspect the Page

Developers inspect the webpage structure to identify where the required data is located.

This may include:

  • HTML elements

  • Tags

  • Classes

  • IDs

Step 3: Find the Data to Extract

Specific information is identified for extraction.

Examples include:

  • Product names

  • Prices

  • Contact details

  • Tables

  • Images

Step 4: Write the Code

Developers create scripts or automation workflows to extract the required information.

Example scraping technologies may include:

  • Python scripts

  • RPA bots

  • Scraping frameworks

Step 5: Run the Code and Extract the Data

The script or automation tool retrieves the information automatically.

Step 6: Store the Data in the Required Format

Extracted information is stored in formats such as:

  • CSV files

  • Databases

  • Excel spreadsheets

  • JSON files

Proper storage allows organisations to analyse and use the information effectively.


KT0406: Libraries Used for Web Scraping

Libraries are collections of prewritten code used to simplify development tasks.

Web scraping libraries help developers extract and process information efficiently.

Common web scraping libraries include:

Library Purpose
Beautiful Soup Parses HTML and XML
Scrapy Web scraping framework
Selenium Browser automation
Requests Sends HTTP requests

Beautiful Soup

Beautiful Soup helps developers navigate and extract information from HTML pages.

Scrapy

Scrapy is a powerful framework designed for large-scale scraping projects.

Selenium

Selenium automates browser interactions and is useful for dynamic websites.

Requests Library

The Requests library sends HTTP requests to websites to retrieve webpage information.

Libraries improve efficiency because developers do not need to write all scraping functionality from scratch.


Data Scraping in Automation and RPA

In automation environments, RPA bots may use scraping technologies to:

  • Extract website information

  • Process customer data

  • Collect reports

  • Monitor systems

  • Automate repetitive online tasks

Data scraping supports intelligent automation because bots can collect information automatically and feed it into workflows and reporting systems.


Key Notes

  • Data scraping is the automated extraction of information from digital sources.

  • Organisations use scraping to collect data for analysis, reporting, and automation.

  • Common scraping tools include Beautiful Soup, Scrapy, Selenium, and UiPath tools.

  • Legal and ethical compliance is important during data scraping activities.

  • Web scraping follows a structured process from identifying URLs to storing extracted data.

  • Libraries simplify scraping and browser automation processes.

  • RPA bots often integrate scraping technologies into automation workflows.

Scroll to Top