Introduction
In this lesson, you will learn about the SQLite AUTOINCREMENT column attribute for specifying primary key values that are assigned by the database and do not have to be specified explicitly. While convenient, they can slow insertion performance. We recommend that you avoid using AUTOINCREMENT. Nevertheless, this tutorial will explain how to use the keyword.
Creating Tables
create table lessons (
lname text not null,
llength integer not null default 0
);
Let’s add a few rows of data to the table so we can see how SQLite assigns a row id.
insert into lessons values
('Intro C++', 90),
('Intro Java', 90),
('SQL Joins', 60)
Row IDs as Implicit Primary Keys
When creating a table, SQLite adds an automatic column called rowid (also accessible by the aliases _rowid_ and oid). Thus, there is no requirement to have an explicitly specified primary key column in SQLite.
Let’s select the “hidden” column rowid. Of course, we should not rely on the fact that the row id starts at 1 and the id’s are in sequence. We can, however, rely on them being unique integers. The row id is a 64-bit integer, so the maximum value is 264−1.
select rowid, lname, llength from lessons;
Table 1: 3 records
1 |
Intro C++ |
90 |
2 |
Intro Java |
90 |
3 |
SQL Joins |
60 |
To create a table without a row id added, you need to add a the WITHOUT ROWID option. If we do that, then we must specify a primary key column explicitly. Of course, there is no need to specify the primary key column as NOT NULL as that is implied.
Note that the WITHOUT ROWID option is only available in SQLite 3.8.2 or later.
create table lessons (
lid integer primary key,
lname text not null,
llength integer not null default 0
) without rowid;
Alternatively, you can specify the primary key column with the primary key constraint. This is necessary of the primary key consists of more than one column.
create table lessons (
lid integer,
lname text not null,
llength integer not null default 0,
primary key (lid)
) without rowid;
When we specify WITHOUT ROWID, then we do not have an implicit primary key through the hidden rowid column and consequently we need to assign unique primary key values when inserting rows. Primary keys are often integers but do not have to be and their values do not have to be sequential. Attempting to insert a duplicate primary key value would naturally result in the error “UNIQUE constraint failed”.
insert into lessons values
(101, 'Intro C++', 90),
(102, 'Intro Java', 90),
(605, 'SQL Joins', 60)
And here is the data:
Table 2: 3 records
101 |
Intro C++ |
90 |
102 |
Intro Java |
90 |
605 |
SQL Joins |
60 |
Automatic Primary Keys with AUTOINCREMENT
Another alternative to synthetic primary keys and row id’s is the AUTOINCREMENT attribute for primary key columns where SQLite generates a unique and increasing value for a primary key. The value is generated and SQLite only guarantees that is is larger than any prior value but does not guarantee sequentially increasing values – particularly in concurrent situations.
The table definition below specifies a primary key with an automatic primary key value.
create table lessons (
lid integer primary key autoincrement,
lname text not null,
llength integer not null default 0
);
When a primary key is auto incremented then we do not have to specify a value when inserting a new row, although we can. Note that you must specify the column names explicitly when using auto incremented primary keys, e.g., the (lname, llength) after insert into lessons and before values is essential – omitting it will result in a column mismatch error.
insert into lessons (lname, llength) values
('Intro C++', 90),
('Intro Java', 90),
('SQL Joins', 60)
Let’s retrieve the data and see the primary key values.
Table 3: 3 records
1 |
Intro C++ |
90 |
2 |
Intro Java |
90 |
3 |
SQL Joins |
60 |
Of course, you can still specify a primary key value explicitly:
insert into lessons (lid, lname, llength) values
(399, 'Intro Python', 90)
But notice the next value of the auto increment after you specified a primary key.
insert into lessons (lname, llength) values
('Intro R', 65)
The value is now incremented from the largest value (to avoid possible duplication).
Table 4: 5 records
1 |
Intro C++ |
90 |
2 |
Intro Java |
90 |
3 |
SQL Joins |
60 |
399 |
Intro Python |
90 |
400 |
Intro R |
65 |
Note that AUTOINCREMENT does not reuse values (such as when a row is deleted) and thus it is possible to eventually exceed the maximum value and get an (erroneous) “disk full” error. On the other hand, ROWID does reuse values and thus is less likely to run out of disk space.
Conclusion
Assigning primary key values using auto increment or using a row id is convenient but can result in slower performance and issues in connecting tables. While often useful, both should generally be avoided in favor of application generated primary keys.
Since row id is potentially slow in generating a new key, tables should be created without a row id and either use an application generated primary key or an auto incremented value.
Errata
None collected yet. Let us know.
---
title: "Primary Keys with AUTOINCREMENT and ROWID in SQLite"
params:
  category: 70
  number: 809
  time: 30
  level: beginner
  tags: "sqlite,create table,auto increment"
  description: "Explains the use and specification of auto incremented primary
                keys in SQLite."
date: "<small>`r Sys.Date()`</small>"
author: "<small>Martin Schedlbauer</small>"
email: "m.schedlbauer@neu.edu"
affilitation: "Northeastern University"
output: 
  bookdown::html_document2:
    toc: true
    toc_float: true
    collapsed: false
    number_sections: false
    code_download: true
    theme: spacelab
    highlight: tango
---

---
title: "<small>`r params$category`.`r params$number`</small><br/><span style='color: #2E4053; font-size: 0.9em'>`r rmarkdown::metadata$title`</span>"
---

```{r code=xfun::read_utf8(paste0(here::here(),'/R/_insert2DB.R')), include = FALSE}
```

## Introduction

In this lesson, you will learn about the SQLite *AUTOINCREMENT* column attribute for specifying primary key values that are assigned by the database and do not have to be specified explicitly. While convenient, they can slow insertion performance. We recommend that you avoid using AUTOINCREMENT. Nevertheless, this tutorial will explain how to use the keyword.

## Creating Tables

```{r createDB, echo=F}
# change this code to connect to your specific database
library(RSQLite)
dbcon <- dbConnect(RSQLite::SQLite(), ":memory:")
```

```{sql connection=dbcon, echo=F}
drop table if exists lessons;
```

```{sql connection=dbcon}
create table lessons (
  lname text not null,
  llength integer not null default 0
);
```

Let's add a few rows of data to the table so we can see how SQLite assigns a row id.

```{sql connection=dbcon}
insert into lessons values
  ('Intro C++', 90),
  ('Intro Java', 90),
  ('SQL Joins', 60)
```

## Row IDs as Implicit Primary Keys

When creating a table, SQLite adds an automatic column called *rowid* (also accessible by the aliases *\_rowid\_* and *oid*). Thus, there is no requirement to have an explicitly specified primary key column in SQLite.

Let's select the "hidden" column *rowid*. Of course, we should not rely on the fact that the row id starts at 1 and the id's are in sequence. We can, however, rely on them being unique integers. The row id is a 64-bit integer, so the maximum value is $2^{64}-1$.

```{sql connection=dbcon}
select rowid, lname, llength from lessons;
```

To create a table without a row id added, you need to add a the *WITHOUT ROWID* option. If we do that, then we must specify a primary key column explicitly. Of course, there is no need to specify the primary key column as *NOT NULL* as that is implied.

Note that the *WITHOUT ROWID* option is only available in SQLite 3.8.2 or later.

```{sql connection=dbcon, echo=F}
drop table if exists lessons;
```

```{sql connection=dbcon}
create table lessons (
  lid integer primary key,
  lname text not null,
  llength integer not null default 0
) without rowid;
```

Alternatively, you can specify the primary key column with the *primary key* constraint. This is necessary of the primary key consists of more than one column.

```{sql connection=dbcon, eval=F}
create table lessons (
  lid integer,
  lname text not null,
  llength integer not null default 0,
  primary key (lid)
) without rowid;
```

When we specify *WITHOUT ROWID*, then we do not have an implicit primary key through the hidden *rowid* column and consequently we need to assign unique primary key values when inserting rows. Primary keys are often integers but do not have to be and their values do not have to be sequential. Attempting to insert a duplicate primary key value would naturally result in the error "UNIQUE constraint failed".

```{sql connection=dbcon}
insert into lessons values
  (101, 'Intro C++', 90),
  (102, 'Intro Java', 90),
  (605, 'SQL Joins', 60)
```

And here is the data:

```{sql connection=dbcon}
select * from lessons;
```

## Automatic Primary Keys with AUTOINCREMENT

Another alternative to synthetic primary keys and row id's is the *AUTOINCREMENT* attribute for primary key columns where SQLite generates a unique and increasing value for a primary key. The value is generated and SQLite only guarantees that is is larger than any prior value but does not guarantee sequentially increasing values -- particularly in concurrent situations.

The table definition below specifies a primary key with an automatic primary key value.

```{sql connection=dbcon, echo=F}
drop table if exists lessons;
```

```{sql connection=dbcon}
create table lessons (
  lid integer primary key autoincrement,
  lname text not null,
  llength integer not null default 0
);
```

When a primary key is auto incremented then we do not have to specify a value when inserting a new row, although we can. Note that you must specify the column names explicitly when using auto incremented primary keys, e.g., the *(lname, llength)* after *insert into lessons* and before *values* is essential -- omitting it will result in a column mismatch error.

```{sql connection=dbcon}
insert into lessons (lname, llength) values 
  ('Intro C++', 90),
  ('Intro Java', 90),
  ('SQL Joins', 60)
```

Let's retrieve the data and see the primary key values.

```{sql connection=dbcon}
select * from lessons;
```

Of course, you can still specify a primary key value explicitly:

```{sql connection=dbcon}
insert into lessons (lid, lname, llength) values 
  (399, 'Intro Python', 90)
```

But notice the next value of the auto increment after you specified a primary key.

```{sql connection=dbcon}
insert into lessons (lname, llength) values 
  ('Intro R', 65)
```

The value is now incremented from the largest value (to avoid possible duplication).

```{sql connection=dbcon}
select * from lessons;
```

Note that *AUTOINCREMENT* does not reuse values (such as when a row is deleted) and thus it is possible to eventually exceed the maximum value and get an (erroneous) "disk full" error. On the other hand, *ROWID* does reuse values and thus is less likely to run out of disk space.

## Conclusion

Assigning primary key values using auto increment or using a row id is convenient but can result in slower performance and issues in connecting tables. While often useful, both should generally be avoided in favor of application generated primary keys.

Since row id is potentially slow in generating a new key, tables should be created without a row id and either use an application generated primary key or an auto incremented value.

------------------------------------------------------------------------

## Files & Resources

```{r zipFiles, echo=FALSE}
zipName = sprintf("LessonFiles-%s-%s.zip", 
                 params$category,
                 params$number)

textALink = paste0("All Files for Lesson ", 
               params$category,".",params$number)

# downloadFilesLink() is included from _insert2DB.R
knitr::raw_html(downloadFilesLink(".", zipName, textALink))
```

------------------------------------------------------------------------

## References

[SQLite AUTOINCREMENT. SQLite Tutorial.](https://www.sqlitetutorial.net/sqlite-autoincrement/)

[SQLite AUTOINCREMENT. SQLite Documentation.](https://www.sqlite.org/autoinc.html)

## Errata

None collected yet. Let us know.

```{=html}
<script src="https://form.jotform.com/static/feedback2.js" type="text/javascript">
  new JotformFeedback({
    formId: "212187072784157",
    buttonText: "Feedback",
    base: "https://form.jotform.com/",
    background: "#F59202",
    fontColor: "#FFFFFF",
    buttonSide: "left",
    buttonAlign: "center",
    type: false,
    width: 700,
    height: 500,
    isCardForm: false
  });
</script>
```
```{r code=xfun::read_utf8(paste0(here::here(),'/R/_deployKnit.R')), include = FALSE}
```
