Author: Alexey Milovidov, 2018-11-26.
LowCardinality data types in production.
Choice of compression algorithm at individual column level.
Support for computed DEFAULT expressions
when importing JSONEachRow.
Support for Parquet format for import and export.
Import/export data to HDFS using table function.
Import/export data to S3 using table function.
Ability to add new columns
to the sorting key of MergeTree tables.
Reduction of metadata volume in ZooKeeper.
Ability to create dictionaries via DDL queries.
Adaptive index granularity in MergeTree tables.
Access restriction at the level of tables, columns and rows
(row-level security).
Role-based access control (RBAC).
Ability to connect an external system
for authentication (LDAP, Kerberos).
Configurable resource pools: CPU share, IO, Network, RAM.
Multiple JOINs without using nested subqueries.
Merge JOIN for joining very large sets.
Bucket-Shuffle JOIN for optimizing large distributed JOINs.
To be precise — index structures for data skipping.
min/max, distinct values, micro bloom-filter.
Ability to create and apply a model
directly in ClickHouse.
What is it?
SELECT * FROM sensors ORDER BY time DESC LIMIT 10
Functions for working with geohash.
Dictionaries of polygons for searching (region by location queries).
Min-hash algorithm for fuzzy search of near-duplicates.
Fast matching of a large number of substrings.
Ability to create additional structure to accelerate brute-force substring search in string.
Separation of hot and cold data on SSD and HDD.
Ability to use JBOD.
Getting rid of problems with frequent inserts.