Creating Time Series Collections in MongoDB

This article is part of Robert Sheldon's continuing series on Mongo DB. To see all of the items in the series, click here.

Throughout this series, I’ve introduced you to different features in MongoDB and provided examples to help demonstrate how the database system works. The examples have all been based on conventional collections, the type that MongoDB creates by default. However, MongoDB also supports other types of collections, including the time series collection, which can benefit many of today’s event-driven workloads.

The documents in a time series collection represent a sequence of data points, with each document recording an event at a specific point of time. For example, a machine sensor might generate temperature readings that are transformed into individual documents and stored in a time series collection.

A time series collection is optimized to handle these types of documents and the workloads they support, offering improved query performance and reduced storage consumption. MongoDB automatically stores the collection’s data in groups of related documents and indexes them based on their date values and unique group identifiers.

In this article, I introduce you to the time series collection and demonstrate different ways you can create them in MongoDB Shell. If you want to try out the examples, you can use the version of Shell embedded in MongoDB Compass or the one you access through your system’s command-line interface. You can also create time series collections in the Compass GUI, although this article focuses on the Shell commands.

Note: For the examples in this article, I used the same MongoDB Atlas environment I used for the previous articles in this series. Refer to the first article for details about setting up these environments. For this article, the examples are based on the iot database, which you can create in advance or when you try out the exercises.

Adding a time series collection to a MongoDB database

To create a time series collection in MongoDB Shell, you can use the createCollection method, just like you can for a conventional collection. The primary difference is that, for a time series collection, you must include the timeseries option in your collection definition, as shown in the following syntax:

db.createCollection(

"collection_name",

{

timeseries:

{

timeField: "field_name",

metaField: "field_name",

granularity_options

expireAfterSeconds: num_seconds

}

);

The command’s syntax consists of the following elements:

db. System variable for referencing the current database and accessing the properties and methods available to the database object. For this article, you should ensure that iot is the current database.
createCollection. Database method for creating a collection in the current database.
collection_name. Placeholder for the name of the new collection. For this article, we’ll be creating the pressure collection.
timeseries. An option available to the createCollection method for creating a time series collection. The option defines an embedded document that includes parameters specific to a time series collection.
timeField. A timeseries parameter that specifies a date field in the collection’s documents. The field must be defined as a valid BSON data type. BSON is a binary encoding of JSON.
metaField. An optional timeseries parameter that specifies a metadata field in the collection’s documents. The field should contain data that can uniquely identify a related group of documents. For example, the field might identify a weather sensor and its location. Only documents that are generated by the same sensor at the same location can be included in the same bucket. The field value should rarely, if ever, change. Although this setting is optional, its inclusion can improve query performance because it can be used as part of a compound index along with the field assigned to the timeField parameter.
granularity_options. Placeholder for one or more parameters that specify the collection’s granularity, which determines how the collection’s documents are bucketed into related groups of data. I discuss the granularity options in more detail later in the article.
expireAfterSeconds. An optional parameter that lets you specify whether the documents in a time series collection should be automatically deleted after a certain amount of time. If the setting is included, it should be defined with an integer value that indicates when the documents will be deleted. The integer, as indicated by the num_seconds placeholder, determines the number of seconds that should pass before a document expires.

The documents in a time series collection typically contain a date field that is assigned to the timeField parameter, a metadata field that is assigned to the metaField parameter, and some type of measure specific to the date field and metadata field. For instance, the documents in a time series collection that tracks global temperatures might include the following three fields:

A date field that records when the temperature was measured.
A metadata field that identifies the weather sensor and its location.
A measure field that records the temperature.

Each document in a time series collection represents an event at a specific point in time, such as the weather station’s temperature readings. Other examples include website views, stock trades, inventory changes, sensor data from internet of things (IoT) devices, and a variety of other use cases. The key is to define your time series collections to meet the specific needs of your workloads. You’ll get a better sense of how each of these elements work as we progress through the article.

Creating a time series collection

Now that we’ve reviewed the syntax, let’s look at an example of how to create a time series collection. We’ll start with a basic collection that uses the default granularity and does not expire the documents.

You’ll be creating the collection in the iot database, so you’ll need to change the context to that database. To do so, launch MongoDB Shell and, at the command prompt, enter the following command:

use iot;

The command switches the shell’s database context to iot. You can use this command even if you have not yet created the database. If the database does not exist, MongoDB will automatically create it when you add the collection.

Once you’ve established the database context, you can use the createCollection method to add the pressure collection, as shown in the following command:

db.createCollection(

"pressure",

{

timeseries: {

timeField: "timestamp",

metaField: "source"

}

);

The timeseries element in the collection definition includes the following two parameters:

The timeField parameter specifies the timestamp field, which contains the document’s timestamp.
The metaField parameter specifies the source field, which is an embedded document that contains a system identifier and sensor identifier.

That’s all there is to creating a basic time series collection. The trick is to know in advance which fields you plan to assign to the timeField and metaField parameters. The fields will be specific to the documents you’ll be inserting into the collection.

When you run the createCollection command, MongoDB automatically creates a compound index on the fields specified in the timeField and metaField parameters. In this case, MongoDB creates the index on the source and timestamp fields, as indicated by the index name, source_1_timestamp_1.

After you create the collection, you can then run the following insertMany command to add sample data to the collection:

db.pressure.insertMany([

{

"timestamp": ISODate("2024-12-01T12:05:00.000Z"),