Objectives
Upon completion of this lesson, you will be able to:
- explain the need for cursors
- know when to use cursors
- use cursors in MySQL stored procedures
Overview
In relational databases, a cursor is a programming construct that allows a developer to traverse and manipulate the result set returned by a SQL SELECT query. It is typically used in stored procedures or other database programming scenarios where the developer needs to work with individual rows of a result set one at a time.
Cursors can be useful in situations where the data needs to be processed row-by-row and where it is not practical or efficient to load the entire result set into memory at once. For example, if a stored procedure needs to perform some calculations on each row of a large result set, using a cursor can help to conserve memory and reduce the amount of processing required at any given time.
However, it is important to note that cursors can also be a source of performance problems if not used carefully, since they can require additional overhead and may lead to locking and blocking issues in concurrent database access situations if not properly managed. Therefore, it is generally recommended to use cursors only when necessary and to consider alternative approaches, such as in-memory traversal, whenever possible.
Cursors in MySQL
To create and use a cursor in MySQL, follow these steps:
- Declare the cursor: To declare a cursor, you can use the DECLARE statement followed by a unique name for the cursor and the SELECT statement that defines the result set. For example:
DECLARE my_cursor CURSOR FOR SELECT * FROM my_table;
- Open the cursor: To open the cursor, you can use the OPEN statement followed by the cursor name. For example:
- Fetch the rows: To fetch the rows one by one, you can use the FETCH statement followed by the cursor name and the variable names that will hold the values of the columns in each row. For example:
FETCH my_cursor INTO var1, var2, var3;
You can use a loop to fetch all the rows one by one until there are no more rows to fetch.
- Close the cursor: When you are done fetching the rows, you should close the cursor using the CLOSE statement followed by the cursor name. For example:
The cursors provided by MySQL are embedded cursors and are subject to some restrictions and have certain characteristics. Specifically, they are:
READ ONLY, i.e., is not possible to update any table through a cursor
Non-Scrollable, i.e., with cursors you can only retrieve records from a table in one direction
Asensitive, i.e., cursors are insensitive to the changes that are made in the table and any modifications made in the table are not reflected in the cursor; that implies that a cursor moving over all the records in a table would not see any changes made to the table after the cursor is created
Sample Database
The SQL statement below creates the customers and export tables used in the examples.
Using Cursors from Stored Procedures
Cursors are most commonly employed within stored procedures. The example below defines a simple stored procedure that uses a cursor:
CREATE PROCEDURE ExportDataProc()
BEGIN
DECLARE done INT DEFAULT 0;
DECLARE custID INTEGER;
DECLARE custProf, custOrigin VARCHAR(64);
DECLARE aCursor CURSOR FOR SELECT cid, profession, country FROM customers;
DECLARE CONTINUE HANDLER FOR NOT FOUND SET done = 1;
OPEN aCursor;
label: LOOP
FETCH aCursor INTO custID, custProf, custOrigin;
INSERT INTO export VALUES(custID, custProf, custOrigin);
IF done = 1 THEN LEAVE label;
END IF;
END LOOP;
CLOSE aCursor;
END
To demonstrate the the cursor and stored procedure work, we will call the procedure from R:
status <- dbExecute(conn = con,
"CALL ExportDataProc();")
Summary
Cursors in relational databases allow a developer to linearly traverse the result set returned by a SQL SELECT query. They are useful when data needs to be processed row-by-row, and loading the entire result set into memory at once is not appropriate.
MySQL cursors are read-only, non-scrollable, and insensitive to changes made to the table while the cursor is active.
Developers should use cursors only when necessary and consider alternative approaches such as in-memory traversal.
Errata
None collected yet. Let us know.
---
title: "Efficient Data Retrieval with Cursors"
params:
  category: 70
  number: 605
  time: 30
  level: beginner
  tags: "MySQL,cursors,SQL"
  description: "This lesson explains how to leverage cursors for
                efficient retrieval of large result sets. Uses
                programming examples in MySQL."
date: "<small>`r Sys.Date()`</small>"
author: "<small>Martin Schedlbauer</small>"
email: "m.schedlbauer@neu.edu"
affilitation: "Northeastern University"
output: 
  bookdown::html_document2:
    toc: true
    toc_float: true
    collapsed: false
    number_sections: false
    code_download: true
    theme: spacelab
    highlight: tango
---

---
title: "<small>`r params$category`.`r params$number`</small><br/><span style='color: #2E4053; font-size: 0.9em'>`r rmarkdown::metadata$title`</span>"
---

```{r code=xfun::read_utf8(paste0(here::here(),'/R/_insert2DB.R')), include = FALSE}
```

------------------------------------------------------------------------

## Objectives

Upon completion of this lesson, you will be able to:

-   explain the need for cursors
-   know when to use cursors
-   use cursors in MySQL stored procedures

------------------------------------------------------------------------

## Overview

In relational databases, a *cursor* is a programming construct that allows a developer to traverse and manipulate the result set returned by a SQL SELECT query. It is typically used in stored procedures or other database programming scenarios where the developer needs to work with individual rows of a result set one at a time.

Cursors can be useful in situations where the data needs to be processed row-by-row and where it is not practical or efficient to load the entire result set into memory at once. For example, if a stored procedure needs to perform some calculations on each row of a large result set, using a cursor can help to conserve memory and reduce the amount of processing required at any given time.

However, it is important to note that cursors can also be a source of performance problems if not used carefully, since they can require additional overhead and may lead to locking and blocking issues in concurrent database access situations if not properly managed. Therefore, it is generally recommended to use cursors only when necessary and to consider alternative approaches, such as in-memory traversal, whenever possible.

## Cursors in MySQL

To create and use a cursor in MySQL, follow these steps:

1.  **Declare the cursor**: To declare a cursor, you can use the *DECLARE* statement followed by a unique name for the cursor and the *SELECT* statement that defines the result set. For example:

``` sql
DECLARE my_cursor CURSOR FOR SELECT * FROM my_table;
```

2.  **Open the cursor**: To open the cursor, you can use the *OPEN* statement followed by the cursor name. For example:

``` sql
OPEN my_cursor;
```

3.  **Fetch the rows**: To fetch the rows one by one, you can use the *FETCH* statement followed by the cursor name and the variable names that will hold the values of the columns in each row. For example:

``` sql
FETCH my_cursor INTO var1, var2, var3;
```

You can use a loop to fetch all the rows one by one until there are no more rows to fetch.

4.  **Close the cursor**: When you are done fetching the rows, you should close the cursor using the *CLOSE* statement followed by the cursor name. For example:

``` sql
CLOSE my_cursor;
```

The cursors provided by MySQL are embedded cursors and are subject to some restrictions and have certain characteristics. Specifically, they are:

**READ ONLY**, *i.e.*, is not possible to update any table through a cursor

**Non-Scrollable**, *i.e.*, with cursors you can only retrieve records from a table in one direction

**Asensitive**, *i.e.*, cursors are insensitive to the changes that are made in the table and any modifications made in the table are not reflected in the cursor; that implies that a cursor moving over all the records in a table would not see any changes made to the table after the cursor is created

### Sample Database

```{r connect2db4free, eval=T, echo=F, message=F, warning=F}
library(RMySQL)

# MySQL Database Connection Settings
db_user <- 'cs5200' 
db_password <- 'khour#2021'
db_name <- 'cs5200db1'
db_table <- 'courses'
db_host <- 'db4free.net' # Host 85.10.205.173
db_port <- 3306

# Connect to MySQL Database
con <-  dbConnect(MySQL(), user = db_user, password = db_password,
                  dbname = db_name, host = db_host, port = db_port)

```

The SQL statement below creates the *customers* and *export* tables used in the examples.

```{sql dropCustomersTable, connection=con, include=F, echo=F, eval=F}
DROP TABLE IF EXISTS customers;
```

```{sql dropExportTable, connection=con, include=F, echo=F, eval=F}
DROP TABLE IF EXISTS export;
```

```{sql createTable, connection=con, include=F, echo=T, eval=F}
CREATE TABLE IF NOT EXISTS customers 
(
  cid INTEGER PRIMARY KEY,
  name TEXT NOT NULL,
  age INTEGER NOT NULL,
  country TEXT NOT NULL,
  profession TEXT NOT NULL
)
```

```{sql createTable2, connection=con, include=F, echo=T, eval=F}
CREATE TABLE IF NOT EXISTS export 
(
  name TEXT NOT NULL,
  job TEXT NOT NULL,
  region TEXT NOT NULL
)
```

```{r addCSV2DB, eval=F, echo=F}
dbWriteTable(con, 
             name="customers", 
             value="customers.csv", 
             row.names=FALSE, 
             header=TRUE, 
             sep = ",", 
             overwrite = TRUE)
```

```{r loadDB, echo=F, eval=F}
df <- read.csv(file = "customers.csv",
               header = T,
               stringsAsFactors = F)

for (i in 1:1000) {
  r <- df[i,]
  
  sql <- paste0("INSERT INTO customers VALUES (",
                   r$cid, ",",
                   "'", r$name, "',",
                   r$age, ",",
                   "'", r$country, "',",
                   "'", r$profession, "')")
  dbSendStatement(conn = con, 
                  statement = sql)
}
```

```{sql connection=con, include=F, echo=F, eval=F}
select * from customers limit 6;
```

```{sql connection=con, include=F, echo=F, eval=F}
select count(*) from customers;
```

```{sql connection=con, include=F, echo=F, eval=F}
select * from export limit 6;
```

### Using Cursors from Stored Procedures

Cursors are most commonly employed within stored procedures. The example below defines a simple stored procedure that uses a cursor:

```{sql defSP, connection=con,eval=F}
CREATE PROCEDURE ExportDataProc()
   BEGIN
      DECLARE done INT DEFAULT 0;
      DECLARE custID INTEGER;
      DECLARE custProf, custOrigin VARCHAR(64);
      DECLARE aCursor CURSOR FOR SELECT cid, profession, country FROM customers;
      DECLARE CONTINUE HANDLER FOR NOT FOUND SET done = 1;
      OPEN aCursor;
      label: LOOP
      FETCH aCursor INTO custID, custProf, custOrigin;
      INSERT INTO export VALUES(custID, custProf, custOrigin);
      IF done = 1 THEN LEAVE label;
      END IF;
      END LOOP;
      CLOSE aCursor;
   END
```

To demonstrate the the cursor and stored procedure work, we will call the procedure from R:

```{r callSP, echo=T, eval=F}
status <- dbExecute(conn = con,
                    "CALL ExportDataProc();")
```

```{r discfromDB, echo=F, eval=F}
dbDisconnect(con)
```

## Summary

Cursors in relational databases allow a developer to linearly traverse the result set returned by a SQL SELECT query. They are useful when data needs to be processed row-by-row, and loading the entire result set into memory at once is not appropriate.

MySQL cursors are read-only, non-scrollable, and insensitive to changes made to the table while the cursor is active.

Developers should use cursors only when necessary and consider alternative approaches such as in-memory traversal.

------------------------------------------------------------------------

## Files & Resources {#files-resources}

```{r zipFiles, echo=FALSE}
zipName = sprintf("LessonFiles-%s-%s.zip", 
                 params$category,
                 params$number)

textALink = paste0("All Files for Lesson ", 
               params$category,".",params$number)

# downloadFilesLink() is included from _insert2DB.R
knitr::raw_html(downloadFilesLink(".", zipName, textALink))
```

------------------------------------------------------------------------

## See Also

-   [70.907 -- Stored Procedures in MySQL](http://artificium.us/lessons/70.sql/l-70-907-stored-procs-mysql/l-70-907.html)

## References

[TutorialsPoint: MySQL Cursor DECLARE Statement](https://www.tutorialspoint.com/mysql/mysql_cursor_declare_statement.htm)

## Errata

None collected yet. Let us know.

```{=html}
<script src="https://form.jotform.com/static/feedback2.js" type="text/javascript">
  new JotformFeedback({
    formId: "212187072784157",
    buttonText: "Feedback",
    base: "https://form.jotform.com/",
    background: "#F59202",
    fontColor: "#FFFFFF",
    buttonSide: "left",
    buttonAlign: "center",
    type: false,
    width: 700,
    height: 500,
    isCardForm: false
  });
</script>
```
```{r code=xfun::read_utf8(paste0(here::here(),'/R/_deployKnit.R')), include = FALSE}
```
