Tiger Cloud: Performance, Scale, Enterprise

Self-hosted products

MST

Create a hypertable partitioned on a single dimension with columnstore enabled, or create a standard Postgres relational table.

A hypertable is a specialized Postgres table that automatically partitions your data by time. All actions that work on a Postgres table, work on hypertables. For example, ALTER TABLE and SELECT. By default, a hypertable is partitioned on the time dimension. To add secondary dimensions to a hypertable, call add_dimension. To convert an existing relational table into a hypertable, call create_hypertable.

As the data cools and becomes more suited for analytics, add a columnstore policy so your data is automatically converted to the columnstore after a specific time interval. This columnar format enables fast scanning and aggregation, optimizing performance for analytical workloads while also saving significant storage space. In the columnstore conversion, hypertable chunks are compressed by up to 98%, and organized for efficient, large-scale queries. This columnar format enables fast scanning and aggregation, optimizing performance for analytical workloads. You can also manually convert chunks in a hypertable to the columnstore.

Hypertable to hypertable foreign keys are not allowed, all other combinations are permitted.

The columnstore settings are applied on a per-chunk basis. You can change the settings by calling ALTER TABLE without first converting the entire hypertable back to the rowstore. The new settings apply only to the chunks that have not yet been converted to columnstore, the existing chunks in the columnstore do not change. Similarly, if you remove an existing columnstore policy and then add a new one, the new policy applies only to the unconverted chunks. This means that chunks with different columnstore settings can co-exist in the same hypertable.

TimescaleDB calculates default columnstore settings for each chunk when it is created. These settings apply to each chunk, and not the entire hypertable. To explicitly disable the defaults, set a setting to an empty string.

CREATE TABLE extends the standard Postgres CREATE TABLE. This page explains the features and arguments specific to TimescaleDB.

Since TimescaleDB v2.20.0
  • Create a hypertable partitioned on the time dimension and enable columnstore:

    1. Create the hypertable:

      CREATE TABLE crypto_ticks (
      "time" TIMESTAMPTZ,
      symbol TEXT,
      price DOUBLE PRECISION,
      day_volume NUMERIC
      ) WITH (
      tsdb.hypertable,
      tsdb.partition_column='time',
      tsdb.segmentby='symbol',
      tsdb.orderby='time DESC'
      );
    2. Enable hypercore by adding a columnstore policy:

      CALL add_columnstore_policy('crypto_ticks', after => INTERVAL '1d');
  • Create a hypertable partitioned on the time with fewer chunks based on time interval:

    CREATE TABLE IF NOT EXISTS hypertable_control_chunk_interval(
    time int4 NOT NULL,
    device text,
    value float
    ) WITH (
    tsdb.hypertable,
    tsdb.partition_column='time',
    tsdb.chunk_interval=3453
    );
  • Enable data compression during ingestion:

    When you set timescaledb.enable_direct_compress_copy your data gets compressed in memory during ingestion with COPY statements. By writing the compressed batches immediately in the columnstore, the IO footprint is significantly lower. Also, the columnstore policy you set is less important, INSERT already produces compressed chunks.

    Note

    Please note that this feature is a tech preview and not production-ready. Using this feature could lead to regressed query performance and/or storage ratio, if the ingested batches are not correctly ordered or are of too high cardinality.

    To enable in-memory data compression during ingestion:

    SET timescaledb.enable_direct_compress_copy=on;

    Important facts

    • High cardinality use cases do not produce good batches and lead to degreaded query performance.
    • The columnstore is optimized to store 1000 records per batch, which is the optimal format for ingestion per segment by.
    • WAL records are written for the compressed batches rather than the individual tuples.
    • Currently only COPY is support, INSERT will eventually follow.
    • Best results are achieved for batch ingestion with 1000 records or more, upper boundary is 10.000 records.
    • Continous Aggregates are not supported at the moment.
    1. Create a hypertable:
      CREATE TABLE t(time timestamptz, device text, value float) WITH (tsdb.hypertable,tsdb.partition_column='time');
    2. Copy data into the hypertable: You achieve the highest insert rate using binary format. CSV and text format are also supported.
      COPY t FROM '/tmp/t.binary' WITH (format binary);
  • Create a Postgres relational table:
    CREATE TABLE IF NOT EXISTS relational_table(
    device text,
    value float
    );

The syntax is:

CREATE TABLE <table_name> (
-- Standard Postgres syntax for CREATE TABLE
)
WITH (
tsdb.hypertable = true | false
tsdb.partition_column = '<column_name> ',
tsdb.chunk_interval = '<interval>'
tsdb.create_default_indexes = true | false
tsdb.associated_schema = '<schema_name>',
tsdb.associated_table_prefix = '<prefix>'
tsdb.orderby = '<column_name> [ASC | DESC] [ NULLS { FIRST | LAST } ] [, ...]',
tsdb.segmentby = '<column_name> [, ...]',
tsdb.index = '<index>(<column_name>), index(<column_name>)'
)
NameTypeDefaultRequiredDescription
tsdb.hypertableBOOLEANtrueCreate a new hypertable for time-series data rather than a standard Postgres relational table.
tsdb.partition_columnTEXTtrueSet the time column to automatically partition your time-series data by.
tsdb.chunk_intervalTEXT7 daysChange this to better suit your needs. For example, if you set chunk_interval to 1 day, each chunk stores data from the same day. Data from different days is stored in different chunks.
tsdb.create_default_indexesBOOLEANtrueSet to false to not automatically create indexes.
The default indexes are:
  • On all hypertables, a descending index on partition_column
  • On hypertables with space partitions, an index on the space parameter and partition_column
tsdb.associated_schemaREGCLASS_timescaledb_internalSet the schema name for internal hypertable tables.
tsdb.associated_table_prefixTEXT_hyperSet the prefix for the names of internal hypertable chunks.
tsdb.orderbyTEXTDescending order on the time column in table_name.The order in which items are used in the columnstore. Specified in the same way as an ORDER BY clause in a SELECT query. Setting tsdb.orderby automatically creates an implicit min/max sparse index on the orderby column.
tsdb.segmentbyTEXTTimescaleDB looks at pg_stats and determines an appropriate column based on the data cardinality and distribution. If pg_stats is not available, TimescaleDB looks for an appropriate column from the existing indexes.Set the list of columns used to segment data in the columnstore for table. An identifier representing the source of the data such as device_id or tags_id is usually a good candidate.
tsdb.indexTEXTTimescaleDB evaluates the columns you already have indexed, checks which data types are a good fit for sparse indexing, then creates a sparse index as an optimization.Configure the sparse indexes for compressed chunks. Requires setting tsdb.orderby. Supported index types include:
  • bloom(<column_name>): a probabilistic index, effective for = filters. Cannot be applied to tsdb.orderby columns.
  • minmax(<column_name>): stores min/max values for each compressed chunk. Setting tsdb.orderby automatically creates an implicit min/max sparse index on the orderby column.
  • Define multiple indexes using a comma-separated list. You can set only one index per column. Set to an empty string to avoid using sparse indexes and explicitly disable the default behavior.

    TimescaleDB returns a simple message indicating success or failure.

    Keywords

    Found an issue on this page?Report an issue or Edit this page in GitHub.