StarRocks version 3.5
Upgrade Notes
-
JDK 17 or later is required from StarRocks v3.5.0 onwards.
- To upgrade a cluster from v3.4 or earlier, you must upgrade the version of JDK that StarRocks depends, and remove the options that are incompatible with JDK 17 in the configuration item
JAVA_OPTSin the FE configuration file fe.conf, for example, options that involve CMS and GC. The default value ofJAVA_OPTSin the v3.5 configuration file is recommended. - For clusters using external catalogs, you need to add
--add-opens=java.base/java.util=ALL-UNNAMEDto theJAVA_OPTSconfiguration item in the BE configuration file be.conf. - For clusters using Java UDFs, you need to add
--add-opens=java.base/java.nio=ALL-UNNAMED --add-opens=java.base/sun.nio.ch=ALL-UNNAMEDto theJAVA_OPTSconfiguration item in the BE configuration file be.conf. - In addition, as of v3.5.0, StarRocks no longer provides JVM configurations for specific JDK versions. All versions of JDK use
JAVA_OPTS.
- To upgrade a cluster from v3.4 or earlier, you must upgrade the version of JDK that StarRocks depends, and remove the options that are incompatible with JDK 17 in the configuration item
-
It is recommended to upgrade the cluster to v3.4.10 or later before upgrading it to v3.5. Otherwise, you must manually disable low cardinality optimization during the gray-scale upgrade by executing the following statement:
SET GLOBAL cbo_enable_low_cardinality_optimize=false;
Downgrade Notes
- After upgrading StarRocks to v3.5, DO NOT downgrade it directly to v3.4.0 ~ v3.4.5, otherwise it will cause metadata incompatibility. You must downgrade the cluster to v3.4.6 or later to prevent the issue.
- After upgrading StarRocks to v3.5.2 or later, DO NOT downgrade it to v3.5.0 & v3.5.1, otherwise it will cause FE crash.
3.5.11​
Release date: January 5, 2026
Improvements​
- Supports Arrow Flight data retrieval from inaccessible nodes. #66348
- Logs the cause (including the triggering process information) in the SIGTERM handler. #66737
- Added an FE configuration
enable_statistic_collect_on_updateto control whether UPDATE statements can trigger automatic statistics collection. #66794 - Supports configuring
networkaddress.cache.ttl. #66723 - Improve the “no rows imported” error message. #66624 #66535
- Optimized
deltaRowswith lazy evaluation for large partition tables. #66381 - Optimized materialized view rewrite performance. #66623
- Supports single-tablet
ResultSinkoptimization in shared-data clusters. #66517 rewrite``_``simple``_``agg``_``to``_``meta``_``scanis enabled by default. #64698- Supports pushing down GROUP BY expressions and materialized view rewrite. #66507
- Add overloaded
newMessagemethods to improve materialized view logs. #66367
Bug Fixes​
The following issues have been fixed:
- A Publish Compaction crash when the input rowset is not found. #67154
- Significant CPU overhead and lock contention caused by repetitive invocation of
update_segment_cache_sizewhen querying tables with a large number of columns. #66714 MulticastSinkOperatorstuck in theOUTPUT_FULLstate. #67153- A “column not found” issue in the skew join hint. #66929
- The growth of all tablets continues unabated, and the sum of pending and running tablets is not the total number of tablets. #66718
- Transactions in the Compaction map built during Leader startup cannot be accessed by CompactionScheduler and will never be removed from the map. #66533
- Delta Lake table refresh does not take effect. #67156
- CN crash at queries against non-partitioned Iceberg tables with DATE predicates. #66864
- Statements in Profiles cannot be correctly displayed when multiple statements are submitted. #67097
- Missing dictionary information during collection because Meta Reader does not support reading from Delta column group files. #66995
- Potential Java heap OOM in Java UDAF. #67025
- BE crash due to the incorrect logic of ranking window optimization without PARTITION BY and ORDER BY. #67081
- Misleading log level for timezone cache miss. #66817
- Crash and incorrect results caused by the incorrect
can_use_bfchecking when merging runtime filters. #67021 - Issue about pushing down runtime bitset filter with other OR predicates. #66996
- Patch critical fix from lz4. #67053
- AsyncTaskQueue deadlock issue. #66791
- Cache inconsistency in ObjectColumn. #66957
- RewriteUnnestBitmapRule causes wrong output column types. #66855
- Data races and data loss when there are WRITE or FLUSH tasks after FINISH tasks in the Delta Writer. #66943
- Invalid load channel and misleading internal errors caused by reopened load channels that were previously aborted. #66793
- Bugs of Arrow Flight SQL. #65889
- Issues when querying renamed columns with MetaScan. #66819
- Hash column is not removed before flushing chunk in partitionwise spillable aggregation when skew elimination is off. #66839
- BOOLEAN type default values were not correctly handled when stored as string literals. #66818
- decimal2decimal cast unexpectedly returns the input column as the result directly. #66773
- NPE in query planning during schema change. #66811
- LocalTabletsChannel and LakeTabletsChannel deadlock. #66748
publish_versionlog shows emptytxn_idswith new FE. #66732- Incorrect behavior of the FE configuration
statistic_collect_query_timeout. #66363 - UPDATE statements do not support statistics collection. #66443
- Case rewrite errors related to low cardinality. #66724
- Statistics query failure when the column list is empty. #66138
- Usage/record mismatch when switching warehouse via hint. #66677
ANALYZE TABLEstatements lack ExecTimeout. #66361array_mapreturns wrong results from constant unary expressions. #66514- Foreign key constraints are lost after FE restart. #66474
max(not null string)on empty table throwsstd::length_error. #66554- Concurrency issue between Primary Key index Compaction and Apply. #66282
- Improper behavior of
EXPLAIN <query>. #66542 - Issue when sinking DECIMAL128 to Iceberg table column. #66071
- JSON length check issue for JSON → CHAR/VARCHAR when the target length equals the minimum. #66628
- An expression children count error. #66511
3.5.10​
Release date: December 15, 2025
Improvements​
- Supports dumping plan node IDs in BE crash logs to speed up locating problematic operators. #66454
- Optimized scans on the system views in
information_schemato reduce the overhead. #66200 - Added two histogram metrics (
slow_lock_held_time_msandslow_lock_wait_time_ms) to provide better observability for slow lock scenarios and distinguish between long-held locks and high lock contention. #66027 - Optimized replica lock handling in tablet report and clone flows by switching the lock from database level to table level, reducing lock contention and improving scheduling efficiency. #61939
- Avoided outputting columns in BE storage, and pushed down predicate computation to BE storage. #60462
- Improved query profile accuracy when deploying scan ranges in background threads. #62223
- Improved profile accounting when deploying additional tasks, so CPU time is not repetitively counted. #62186
- Added more detailed error messages when a referenced partition does not exist, making failures easier to diagnose. #65674
- Made sample-type cardinality estimation more robust in corner cases to improve row-count estimates. #65599
- Added a partition filter when loading statistics to prevent INSERT OVERWRITE from reading stale partition statistics. #65578
- Splited pipeline CPU
execution_timemetrics into separate series for queries and loads, improving observability by workload type. #65535 - Supported
enable_statistic_collect_on_first_loadat table granularity for finer-grained control over statistics collection on the first load. #65463 - Renamed the S3-dependent unit test from
PocoClientTestto an S3-specific name to better reflect its dependency and intent. #65524
Bug Fixes​
The following issues have been fixed:
- libhdfs crashes when StarRocks is started with an incompatible JDK. #65882
- Incorrect query results caused by
PartitionColumnMinMaxRewriteRule. #66356 - Rewrite issues due to the materialized view metadata not refreshed when resolving materialized views by AST keys. #66472
- The trim function crashes or produces wrong results when trimming specific Unicode whitespace characters. #66428, #66477
- Failures in load metadata and SQL execution that still referenced a deleted warehouse. #66436
- Wrong results when group execution Join is combined with window functions. #66441
- A possible FE null pointer in
resetDecommStatForSingleReplicaTabletUnlocked. #66034 - Missing Join runtime filter pushdown optimization in shared-data clusters for
LakeDataSource. #66354 - Parameters are inconsistent for runtime filter transmit options (timeouts, HTTP RPC limits, etc.) because they are not forwarded to receivers. #66393
- Automatic partition creation fails when partition values already exist. #66167
- Inaccurate scan statistics in audit logs when predicates have high selectivity. #66280
- Incorrect query results because non-deterministic functions are pushed below operators. #66323
- Exponential growth in the number of expressions caused by CASE WHEN. #66324
- Materialized view compensation bugs when the same table appears multiple times in a query with different partition predicates. #66369
- BE becomes unresponsive when using fork in subprocesses. #66334
- CVE-2025-66566 and CVE-2025-12183. #66453, #66362
- Errors caused by nested CTE reuse. #65800
- Issues due to the lack of validation on conflicting schema-change clauses. #66208
- Improper rowset GC behavior when rowset commit fails. #66301
- A potential use-after-free when counting down pipelines. #65940
- The
W``arehousefield is NULL ininformation_schema.loadsfor Stream Load. #66202 - Issues with materialized view creation when the referenced view has the same name as its base table. #66274
- The global dictionary is not updated correctly under some cases. #66194
- Incorrect query profile logging for queries forwarded from Follower nodes. #64395
- BE crash when caching SELECT results and reordering schema. #65850
- Shadow partitions are dropped when dropping partitions by expression. #66171
- DROP tasks run when a CLONE task exists for the same tablet. #65780
- Stability and observability issues because RocksDB log file options were not properly set. #66166
- Incorrect materialized view compensation that could produce NULL results. #66216
- BE reported as alive even after receiving
SIGSEGV. #66212 - Bugs in Iceberg scans. #65658
- Regression coverage and stability issues for Iceberg view SQL test cases. #66126
- Unexpected behavior because
set_collectoris invoked repetitively. #66199 - Ingestion failures when column-mode partial updates are used together with conditional updates. #66139
- Temporary partition value conflicts under concurrent transactions. #66025
- An Iceberg table cache bug where Guava LocalCache could retain stale entries even when
cache.size() == 0, causing refresh to be ineffective and queries to return outdated tables. #65917 - Incorrect format placeholder in
LargeInPredicateException, causing the actual number of LargeInPredicate occurrences to be incorrectly reported in the error message. #66152 - NullPointerException in
ConnectScheduler’stimeout checker when connectContext is null. #66136 - Crashes caused by unhandled exceptions thrown from threadpool tasks. #66114
- Data loss when pushing down DISTINCT LIMIT in certain plans. #66109
multi_distinct_countnot updatingdistinct_sizeafter the underlying hash set is converted to a two-level hash set, which could lead to incorrect distinct counts. #65916- A race condition when an exec group submits the next driver which could trigger
Check failed: !driver->is_in_blocked()and abort the BE process. #66099 - INSERT failures when running ALTER TABLE ADD COLUMN with a default value concurrently with INSERT, due to mismatched types for the newly added column’s default expression. #65968
- An issue where
MemoryScratchSinkOperatorcould remain inpending_finishafterRecordBatchQueuewas shut down when SparkSQL exited early, causing the pipeline to hang. #66041 - A core dump when reading Parquet files that contain empty row groups. #65928
- Recursive calls and potential stack overflow at high DOP because the event scheduler’s readiness check is complicated. #66016
- Asynchronous materialized view refresh skips updates when the Iceberg base table contains expired snapshots. #65969
- Potential issues in predicate reuse and rewrite because the optimizer relies solely on hashCode to distinguish differences in predicates. #65999
- In the refresh of an asynchronous materialized view with multi-level partitioned base tables, only the parent partition metadata was checked, while sub-partition updates were skipped. #65596
- Statistics collection issues where AVG(ARRAY_LENGTH(...)) could return NULL for empty result sets. #65788
- Runtime profile counters are not correctly updating or clearing their min/max values during incremental updates on both BE and FE. #65869
- Incorrect logic to obtain the image journal ID when creating a cluster snapshot to ensure the snapshot uses the correct log position. #65970
- Results are misreported when the cleanup fails due to incorrect status checking logic in file cleanup error handling. #65709
- A possible infinite loop in certain plans isVariable() in DictMappingOperator. #65743
- Failures and missing audit/profile data because ConnectContext is not passed into scan-range deployment threads. #63544
- A use-after-free issue in the local Primary Key index manager when the storage engine is stopped. #65534
- Statistics collection issues for INSERT OVERWRITE with dynamic overwrite. #65657
- Concurrency issues caused by coarse-grained locks in DiskAndTabletLoadReBalancer. #65557
- Slow locks cannot be detected and reported correctly for the lack of slow-lock detection for critical locks. #65559
- NullPointerException when replaying upsert transaction state after the target database has been dropped. #65595
- Stale statistics were used because outdated partition statistics are not dropped after statistics collection triggered by INSERT OVERWRITE. #65586
- Data race in partition ID allocation that could lead to ID conflicts under concurrency. #65608
- Missing tablet IDs when retrieving initial tablet metadata. #65550
- Incorrect record information for PREPARE/EXECUTE statements in audit and profile logs. #65448
- Potential crashes because the non–thread-safe has_output function is called from multiple threads. #65514
- MemTable finalize tasks cannot be properly tracked because the
memtable_finalize_task_totalcounter metric is lacking. #65548 - Query ID collisions in Arrow Flight, causing multiple queries no longer share the same query ID. #65558
- Lock conflicts for
TabletChecker.doCheck()with other operations. #65237 - Scan behavior is inconsistent between shared-data and shared-nothing clusters, causing query semantics to differ. #61100
3.5.9​
Release date: November 26, 2025
Improvements​
- Added transaction latency metrics to FE for observing timing across transaction stages. #64948
- Supports overwriting S3 unpartitioned Hive tables to simplify full-table rewrites in data lake scenarios. #65340
- Introduced CacheOptions to provide finer-grained control over tablet metadata caching. #65222
- Supports sample statistics collection for INSERT OVERWRITE to ensure statistics stay consistent with the latest data. #65363
- Optimized the statistics collection strategy after INSERT OVERWRITE to avoid missing or incorrect statistics due to asynchronous tablet reports. #65327
- Introduced a retention period for partitions dropped or replaced by INSERT OVERWRITE or materialized view refresh operations, keeping them in the recycle bin for a while to improve recoverability. #64779
Bug Fixes​
The following issues have been fixed:
- Lock contention and concurrency issues related to
LocalMetastore.truncateTable(). #65191 - Lock contention and replica check performance issues related to TabletChecker. #65312
- Incorrect error logging when changing user via HTTP SQL. #65371
- Checksum failures caused by DelVec CRC32 upgrade compatibility issues. #65442
- Tablet metadata load failures caused by RocksDB iteration timeout. #65146
- When the internal
flat_pathstring is empty because the JSON hyper extraction path is$or all paths are skipped, callingsubstrwill throw an exception and cause BE crash. #65260 - The PREPARED flag in fragment execution is not correctly set. #65423
- Inaccurate write and flush metrics caused by duplicated load profile counters. #65252
- When multiple HTTP requests reuse the same TCP connection, if a non‑ExecuteSQL request arrives after an ExecuteSQL request, the
HttpConnectContextcannot be unregistered at channel close, causing HTTP context leaks. #65203 - MySQL 8.0 schema introspection errors (Fixed by adding session variables
default_authentication_pluginandauthentication_policy). #65330 - SHOW ANALYZE STATUS errors caused by unnecessary statistics collection for temporary partitions created after partition overwrite operations. #65298
- Global Runtime Filter race in the Event Scheduler. #65200
- Data Cache is aggressively disabled because the minimum Data Cache disk size constraint is too large. #64909
- An aarch64 build issue related to the
goldlinker automatic fallback. #65156
3.5.8​
Release date: November 10, 2025
Improvements​
- Upgraded Arrow to 19.0.1 to support the Parquet legacy list to include nested, complex files. #64238
- FILES() supports legacy Parquet LIST encodings. #64160
- Automatically determine the Partial Update mode based on the session variable and the number of inserted columns. #62091
- Applied low-cardinality optimization on analytic operators above table functions. #63378
- Added configurable table lock timeout to
finishTransactionto avoid blocking. #63981 - Shared-data clusters support table-level scan metrics attribution. #62832
- Window functions LEAD/LAG/FIRST_VALUE/LAST_VALUE now accept ARRAY type arguments. #63547
- Supports constant folding for several array functions to improve predicate pushdown and join simplification. #63692
- Supports batched API to optimize
tabletNumretrieval for a given node viaSHOW PROC /backends/{id}. Added an FE configuration itemenable_collect_tablet_num_in_show_proc_backend_disk_path(Default:true). #64013 - Ensured
INSERT ... SELECTreads the freshest metadata by refreshing external tables before planning. #64026 - Added
capacity_limit_reachedchecks to table functions, NL-join probe, and hash-join probe to avoid constructing overflowing columns. #64009 - Added FE configuration item
collect_stats_io_tasks_per_connector_operator(Default:4) for setting the maximum number of tasks to collect statistics for external tables. #64016 - Updated the default partition size for sample collection from 1000 to 300. #64022
- Increased lock table slots to 256 and added
ridto slow-lock logs. #63945 - Improved robustness of Gson deserialization in the presence of legacy data. #63555
- Reduced metadata lock scope for FILES() schema pushdown to cut lock contention and planning latency. #63796
- Added Task Run execute timeout checker by introducing an FE configuration item
task_runs_timeout_second, and refined cancellation logics for overdue runs. #63842 - Ensured
REFRESH MATERIALIZED VIEW ... FORCEalways refreshes target partitions (even in inconsistent or corrupted cases). #63844
Bug Fixes​
The following issues have been fixed:
- An exception when parsing the Nullable (Decimal) type of ClickHouse. #64195
- An issue with tablet migration and Primary Key index lookup concurrency. #64164
- Lack of FINISHED status in materialized view refresh. #64191
- Schema Change Publish does not retry in shared-data clusters. #64093
- Wrong row count statistics on Primary Key tables in Data Lake. #64007
- When tablet creation times out in shared-data clusters, node information cannot be returned. #63963
- Corrupted Lake DataCache cannot be cleared. #63182
- Window function with IGNORE NULLS flags can not be consolidated with its counterpart without iIGNORE NULLS flag. #63958
- Table compaction cannot be scheduled again after FE restart if the compaction was previously aborted. #63881
- Tasks fail to be scheduled if FE restarts frequently. #63966
- An issue with GCS error codes. #64066
- Instability issue with StarMgr gRPC executor. #63828
- Deadlock when creating an exclusive work group. #63893
- Cache for Iceberg tables is not properly invalidated. #63971
- Wrong results for sorted aggregation in shared-data clusters. #63849
- ASAN error in
PartitionedSpillerWriter::_remove_partition. #63903 - BE crash when failing to get splits from morsel queue. #62753
- A bug with aggregate push-down type cast in materialized view rewrite. #63875
- NPE when removing expired load jobs in FE. #63820
- Partitioned Spill crash when removing partitions. #63825
- Materialized view rewrite throws
IllegalStateExceptionunder certain plans. #63655 - NPE when creating a partitioned materialized view. #63830
3.5.7​
Release date: October 21, 2025
Improvements​
- Improved memory statistics accuracy for Scan operators by introducing retry backoff under heavy memory contention scenarios. #63788
- Optimized materialized view bucketing inference by leveraging existing tablet distribution to prevent excessive bucket creation. #63367
- Revised the Iceberg table caching mechanism to enhance consistency and reduce cache invalidation risks during frequent metadata updates. #63388
- Added the
querySourcefield toQueryDetailandAuditEventfor better traceability of query origins across APIs and schedulers. #63480 - Enhanced Persistent Index diagnostics by printing detailed context when duplicate keys are detected in MemTable writes. #63560
- Reduced lock contention in materialized view operations by refining lock granularity and sequencing in concurrent scenarios. #63481
Bug Fixes​
The following issues have been fixed:
- Materialized view rewrite failures caused by type mismatch. #63659
regexp_extract_allhas wrong behavior and lacks support forpos=0. #63626- Degraded scan performance caused by the profitless simplification of CASE WHEN with complex functions. #63732
- Incorrect DCG data reading when partial updates switch from column mode to row mode. #61529
- A potential deadlock during initialization of
ExceptionStackContext. #63776 - Crashes in Parquet numeric conversion for ARM architecture machines. #63294
- An issue caused by the aggregate intermediate type uses
ARRAY<NULL_TYPE>. #63371 - Stability issue caused by incorrect overflow detection when casting LARGEINT to DECIMAL128 at sign-edge cases (for example, INT128_MIN) #63559
- LZ4 compression and decompression errors cannot be perceived. #63629
ClassCastExceptionwhen querying tables partitioned byFROM_UNIXTIMEon INT-type columns. #63684- Tablets cannot be repaired after a balance-triggered migration when the only valid source replica is marked
DECOMMISSION. #62942 - Profiles lost SQL statements and Planner Trace when the PREPARE statement is used. #63519
- The
extract_number,extract_bool, andextract_stringfunctions are not exception-safe. #63575 - Shutdown tablets cannot be garbage-collected properly. #63595
- Profiles showing SQL as
omitfor returns of the PREPARE/EXECUTE statements. #62988 date_truncpartition pruning with combined predicates that mistakenly produced EMPTYSET. #63464- Crashes in release builds due to the CHECK in NullableColumn. #63553
3.5.6​
Release date: September 22, 2025
Improvements​
- A decommissioned BE will be forcibly dropped when all its tablets are in the recycle bin, to avoid the decommission being blocked by those tablets. #62781
- Vacuum metrics will be updated when Vacuum succeeds. #62540
- Added thread pool metrics to the fragment instance execution state report, including active threads, queue count, and running threads. #63067
- Supports S3 path-style access in shared-data clusters to improve compatibility with MinIO and other S3-compatible storage systems. You can enable this feature by setting
aws.s3.enable_path_style_accesstotruewhen creating a storage volume. #62591 - Supports resetting the starting point of the AUTO_INCREMENT value via
ALTER TABLE`` <table_name>`` AUTO_INCREMENT`` = 10000;. #62767 - Supports using Distinguished Name (DN) in Group Provider for group matching, improving the user group solution for LDAP/Microsoft Active Directory environments. #62711
- Supports Azure Workload Identity authentication for Azure Data Lake Storage Gen2. #62754
- Added transaction error messages to the
information_schema.``loadsview to aid failure diagnosis. #61364 - Supports reusing common expressions for complex CASE WHEN expressions in Scan predicates to reduce repetitive computation. #62779
- Uses the REFRESH (instead of ALTER) privilege on the materialized view to execute REFRESH statements. #62636
- Disabled low-cardinality optimization on Lake tables by default to avoid potential issues. #62586
- Enabled tablet balancing between workers by default in shared-data clusters. #62661
- Supports reusing expressions in outer-join WHERE predicates to reduce repetitive computation. #62139
- Added Clone metrics in FE. #62421
- Added Clone metrics in BE. #62479
- Added an FE configuration item
enable_statistic_cache_refresh_after_writeto disable statistics-cache lazy refresh by default. #62518 - Masked credential information in SUBMIT TASK for better security. #62311
json_extractin the Trino dialect returns a JSON type. #59718- Supports ARRAY type in
null_or_empty. #62207 - Adjusted the size limit for the Iceberg manifest cache. #61966
- Added a remote file-cache limit for Hive. #62288
Bug Fixes​
The following issues have been fixed:
- Secondary replicas hang indefinitely due to negative timeout values, which cause incorrect timestamp comparisons. #62805
- PublishTask may be blocked when TransactionState is REPLICATION. #61664
- Incorrect repair mechanism for Hive tables that have been dropped and recreated during materialized view refresh. #63072
- Incorrect execution plans were generated after the materialized view aggregation push‑down rewrite. #63060
- ANALYZE PROFILE failures caused by PlanTuningGuide producing unrecognized strings (null explainString) in the query profiles. #63024
- Inappropriate return type of
hour_from_unixtimeand incorrect rewrite rule ofCAST. #63006 - NPE in Iceberg manifest cache under data races. #63043
- Shared-data clusters lack support for colocation in materialized views. #62941
- Iceberg table Scan Exception during Scan Range deployment. #62994
- Incorrect execution plans were generated for view-based rewrite. #62918
- Errors and disrupted tasks due to Compute Nodes are not gracefully shut down on exit. #62916
- NPE when Stream Load execution status updates. #62921
- An issue with statistics when the column name and the name in the PARTITION BY clause differ in case. #62953
- Wrong results are returned when the
LEASTfunction is used as a predicate. #62826 - Invalid ProjectOperator above the table-pruning frontier CTEConsumer. #62914
- Redundant replica handling after Clone. #62542
- Failed to collect Stream Load profiles. #62802
- Ineffective disk rebalancing caused by improper BE selection. #62776
- A potential NPE crash in LocalTabletsChannel when a missing
tablet_idleads to a null delta writer. #62861 - KILL ANALYZE does not take effect. #62842
- SQL syntax errors in histogram stats when MCV values contain single quotes. #62853
- Incorrect output format of metrics for Prometheus. #62742
- NPE when querying
information_schema.analyze_statusafter the database is dropped. #62796 - CVE-2025-58056. #62801
- When SHOW CREATE ROUTINE LOAD is executed, wrong results are returned because the database is considered null if not specified. #62745
- Data loss caused by incorrectly skipping CSV headers in
files(). #62719 - NPE when replaying batch-transaction upserts. #62715
- Publish being incorrectly reported as successful during graceful shutdown in shared-nothing clusters. #62417
- Crash in asynchronous delta writer due to a null pointer. #62626
- Materialized view refresh is skipped because the materialized view version map is not cleared after a failed restore job. #62634
- Issues caused by case-sensitive partition column validation in the materialized view analyzer. #62598
- Duplicate IDs for statements with syntax errors. #62258
- StatisticsExecutor status is overridden due to redundant state assignment in CancelableAnalyzeTask. #62538
- Incorrect error messages produced by statistics collection. #62533
- Premature throttling caused by insufficient default maximum connections for external users. #62523
- A potential NPE in materialized view backup and restore operations. #62514
- Incorrect
http_workers_nummetric. #62457 - The runtime filter fails to locate the corresponding execution group during construction. #62465
- Tedious results on Scan Node caused by simplifying CASE WHEN with complex functions. #62505
gmtimeis not thread-safe. #60483- An issue with getting Hive partitions with escaped strings. #59032
3.5.5​
Release date: September 5, 2025
Improvements​
- Added a new system variable
enable_drop_table_check_mv_dependency(default:false). When set totrue, if the object to be dropped is referenced by a downstream materialized view, the system prevents the execution ofDROP TABLE/DROP VIEW/DROP MATERIALIZED VIEW. The error message lists the dependent materialized views and suggests checking thesys.object_dependenciesview for details. #61584 - Logs now include the Linux distribution and CPU architecture of the build, to facilitate issue reproduction and troubleshooting. Log format:
... build <hash> distro <id> arch <arch>. #62017 - Persisted per-Tablet index and incremental column group file sizes are now cached, replacing on-demand directory scans. This accelerates Tablet status reporting in BE and reduces latency under high I/O scenarios. #61901
- Downgraded several high-frequency INFO logs in FE and BE to VLOG, and aggregated task submission logs, significantly reducing redundant storage-related logs and log volume under heavy load. #62121
- Improved query performance for External Catalog metadata through
information_schemaby pushing table filters before callinggetTable, avoiding per-table RPCs. #62404
Bug Fixes​
The following issues have been fixed:
- NullPointerException when fetching partition-level column statistics during the Plan stage due to missing data. #61935
- Fixed Parquet write issues with non-empty NULL arrays, and corrected
SPLIT(NULL, …)behavior to consistently return NULL, preventing data corruption and runtime errors. #61999 - Failure when creating materialized views using
CASE WHENexpressions due to incompatible VARCHAR type returns (fixed by ensuring consistency before and after refresh, and introducing a new FE configurationtransform_type_prefer_string_for_varcharto prefer STRING and avoid length mismatch). #61996 - Statistics for nested CTEs could not be computed outside of memo when
enable_rbo_table_prunewasfalse. #62070 - In Audit Logs, inaccurate Scan Rows results for INSERT INTO SELECT statements. #61381
- ExceptionInInitializerError/NullPointerException during initialization caused FE startup failure when Query Queue v2 was enabled. #62161
- BE crash when
LakePersistentIndexinitialization failed and_memtablecleanup was triggered. #62279 - Permission issues during materialized view refresh due to creator roles not being activated (fixed by adding FE configuration
mv_use_creator_based_authorization. When set tofalse, materialized views are refreshed as root, for compatibility with LDAP-authenticated clusters). #62396 - Materialized view refresh failures caused by case-sensitive List partition table names (fixed by enforcing case-insensitive uniqueness checks on partition names, aligning with OLAP table semantics). #62389
3.5.4​
Release Date: August 22, 2025
Improvements​
- Added logs to clarify the reason that tablets cannot be repaired.  #61959
- Optimized DROP PARTITION information in logs.  #61787
- Assigned a large but configurable row count to tables with unknown stats for statistical estimation.  #61332
- Added balance statistic according to label location.  #61905
- Added colocate group balance statistics to improve cluster monitoring. #61736
- Skipped the Publish waiting phase when the number of healthy replicas exceeds the default replica count. #61820
- Included the tablet information collection time in the tablet report. #61643
- Supports writing Starlet files with tags.  #61605
- Supports viewing cluster balance statistics via SHOW PROC.  #61578
- Bumped librdkafka to 2.11.0 to support Kafka 4.0 and removed deprecated configurations.  #61698
- Added
prepared_timeoutconfiguration to Stream Load Transaction Interface.  #61539 - Upgraded StarOS to v3.5‑rc3.  #61685
Bug Fixes​
The following issues have been fixed:
- Incorrect Dict version of random distribution tables. #61933
- Incorrect query context in context conditions. #61929
- Publish failures caused by synchronous Publish for shadow tablets during ALTER operations. #61887
- CVE‑2025‑55163 issue.  #62041
- Memory leak in real-time data ingestion from Apache Kafka.  #61698
- Incorrect count of rebuild files in the lake persistent index. #61859
- Statistics collection on generated expression columns causes cross-database query errors. #61829
- Query Cache misaligns in shared-nothing clusters, causing inconsistent results. #61783
- High memory usage in CatalogRecycleBin due to retaining deleted partition information.#61582
- SQL Server JDBC connections fail when the timeout exceeds 65,535 milliseconds. #61719
- Security Integration fails to encrypt passwords, exposing sensitive information. #60666
MIN()andMAX()functions on Iceberg partition columns return NULL unexpectedly.  #61858- Other predicates of Join containing non‑push‑down subfields were incorrectly rewritten.  #61868
- QueryContext cancellation can lead to a use‑after‑free situation.  #61897
- CBO’s table pruning overlooks other predicates.  #61881
- Partial Updates in
COLUMN_UPSERT_MODEmay overwrite auto-increment columns with zero.  #61341 - JDBC TIME type conversion uses an incorrect timezone offset that leads to wrong time values. #61783
max_filter_ratiowas not being serialized in Routine Load jobs. #61755- Precision loss in the
now(precision)function in Stream Load. #61721 - Cancelling a query may result in a “query id not found” error. #61667
- LDAP authentication may miss PartialResultException, causing incomplete query results. #60667
- Paimon Timestamp timezone conversion issue when the query condition contains DATETIME. #60473
3.5.3​
Release Date: August 11, 2025
Improvements​
- Lake Compaction adds Segment write time statistics. #60891
- Disable inline mode for Data Cache writes to avoid performance degradation. #60530
- Iceberg metadata scan supports shared file I/O. #61012
- Support termination of all PENDING ANALYZE tasks. #61118
- Force reuse when there are too many CTE nodes to avoid excessive optimization time. #60983
- Added
BALANCEtype to cluster balance results. #61081 - Optimized materialized view rewrite for external tables. #61037
- Default value of system variable
enable_materialized_view_agg_pushdown_rewriteis changed totrue, enabling aggregation pushdown for materialized view queries by default. #60976 - Optimized partition statistics lock competition. #61041
Bug Fixes​
The following issues have been fixed:
- Inconsistent Chunk column size after column pruning. #61271
- Synchronous execution of partition statistics loading may cause deadlocks. #61300
- Crash when
array_mapprocesses constant array columns. #61309 - Setting an auto-increment column to NULL results in the system mistakenly rejecting valid data within the same Chunk. #61255
- The actual number of JDBC connections may exceed the
jdbc_connection_pool_sizelimit. #61038 - FQDN mode did not use IP addresses as cache map keys. #61203
- Array column cloning error during array comparison. #61036
- Deploying serialized thread pool blockage led to query performance degradation. #61150
- OK hbResponse not synchronized after heartbeat retry counter reset. #61249
- Incorrect result for the
hour_from_unixtimefunction. #61206 - Conflicts between ALTER TABLE jobs and partition creation. #60890
- Cache does not take effect after upgrading from v3.3 to v3.4 or later. #60973
- Vector index metric
hit_countis not set. #61102 - Stream Load transactions fail to find the coordinator node. #60154
- BE crashes when loading OOM partitions. #60778
- INSERT OVERWRITE failed on manually created partitions. #60750
- Partition creation failed when partition names matched case-insensitively but had different values. #60909
- The system does not support PostgreSQL UUID type. #61021
- Case sensitivity issue with column names when loading Parquet data via
FILES(). #61059
3.5.2​
Release Date: July 18, 2025
Improvements​
- Collected NDV (number of distinct values) statistics for ARRAY columns to improve query plan accuracy. #60623
- Disabled replica balancing for Colocate tables and tablet scheduling in Shared-data clusters to reduce unnecessary log output. #60737
- Optimized Catalog access workflow: FE now delays accessing external data sources asynchronously at startup to prevent hanging due to external service unavailability. #60614
- Added session variable
enable_predicate_expr_reuseto control predicate pushdown. #60603 - Supports a retry mechanism when fetching Kafka partition information fails. #60513
- Removed the restriction requiring exact mapping of partition columns between materialized views and base tables. #60565
- Supports building Runtime In-Filters to enhance aggregation performance by filtering data during aggregation. #59288
Bug Fixes​
Fixed the following issues:
- COUNT DISTINCT queries crash due to low-cardinality optimization for multiple columns. #60664
- Incorrect matching of global UDFs when multiple functions share the same name. #60550
- Null pointer exception (NPE) issue during Stream Load import. #60755
- Null pointer exception (NPE) issue when starting FE during a recovery from a cluster snapshot. #60604
- BE crash caused by column mode mismatch when processing short-circuit queries with out-of-order values. #60466
- Session variables set via PROPERTIES in SUBMIT TASK statements did not take effect. #60584
- Incorrect results for
SELECT min/maxqueries under specific conditions. #60601 - Incorrect bucket pruning when the left side of a predicate is a function, leading to incorrect query results. #60467
- Crash for queries against a non-existent
query_idvia Arrow Flight SQL. #60497
Behavior Changes​
- The default value of
lake_compaction_allow_partial_successis set totrue. Compaction operations can now be marked as successful even if partially completed, preventing blockage of subsequent compaction tasks. #60643
3.5.1​
Release Date: July 1, 2025
New Features​
- [Experimental] Starting from v3.5.1, StarRocks introduces a high-performance data transfer channel based on the Apache Arrow Flight SQL protocol, comprehensively optimizing the data import channel and significantly improving transfer efficiency. This solution establishes a fully columnar data transfer pipeline from the StarRocks columnar execution engine to the client, eliminating the frequent row-column conversions and serialization overhead typically seen in traditional JDBC and ODBC interfaces, and achieving true zero-copy, low-latency, and high-throughput data transfer capabilities. #57956
- Java Scalar UDFs (user-defined functions) now support ARRAY and MAP types as input parameters. #55356
- Cross-node data cache sharing: Enables nodes to share cached external table data of data lakes across compute nodes via the network. If a local cache miss occurs, the system first attempts to fetch data from the caches of other nodes within the same cluster. Only if all caches miss will it re-fetch data from remote storage. This feature effectively reduces performance jitter caused by cache invalidation during elastic scaling and ensures stable query performance. A new FE configuration parameter
enable_trace_historical_nodecontrols this behavior (Default:false). #57083 - Storage Volume adds native support for Google Cloud Storage (GCS): You can now use GCS as a backend storage volume and manage and access GCS resources through the native SDK. #58815
Improvements​
- Optimized error messages when creating Hive external tables fails. #60076
- Optimized
count(1)query performance using thefile_record_countin Iceberg metadata. #60022 - Refined the Compaction scheduling logic to avoid delayed scheduling when all subtasks succeed. #59998
- Added
JAVA_OPTS="--add-opens=java.base/java.util=ALL-UNNAMED"to BE and CN after upgrading to JDK 17. #59947 - Supports modifying the
kafka_broker_listproperty via the ALTER ROUTINE LOAD command when Kafka Broker endpoints change. #59787 - Supports reducing build dependencies of the Docker base image through parameters. #59772
- Supports accessing Azure using Managed Identity authentication. #59657
- Improved error messages when querying external data via
Files()function with duplicate path column names. #59597 - Optimized LIMIT pushdown logic. #59265
Bug Fixes​
Fixed the following issues:
- Partition pruning issue when queries include Max and Min aggregations and empty partitions. #60162
- Incorrect query results when rewriting queries with materialized views due to missing NULL partitions. #60087
- Refresh errors on Iceberg external tables when using partition expressions based on
str2date. #60089 - Incorrect partition range when creating temporary partitions using the START END syntax. #60014
- Incorrect display of Routine Load metrics on non-leader FE nodes. #59985
- BE/CN crashes when executing queries containing
COUNT(*)window functions. #60003 - Stream Load failures when the target table name contains Chinese characters. #59722
- Overall loading failures to triple-replica tables when loading to a secondary replica fails. #59762
- Missing parameters in SHOW CREATE VIEW output. #59714
Behavior Changes​
- Some FE metrics include the
is_leaderlabel. #59883
3.5.0​
Release Date: June 13, 2025
Shared-data Enhancement​
- Shared-data clusters support generated columns. #53526
- Cloud-native Primary Key tables in shared-data clusters support rebuilding specific indexes. The performance of the indexes is also optimized. #53971 #54178
- Optimized the execution logic of large-scale data loading operations to avoid generating too many small files in Rowset due to memory limitations. During the import, the system will merge the temporary data blocks to reduce the generation of small files, which improves the query performance after the import and also reduces the subsequent Compaction operations to improve the system resource utilization. #53954
Data Lake Analytics​
- [Beta] Supports creating Iceberg views in the Iceberg Catalog with Hive Metastore integration. And supports adding or modifying the dialect of the Iceberg view using the ALTER VIEW statement for better syntax compatibility with external systems. #56120
- Supports nested namespace for Iceberg REST Catalog. #58016
- Supports using
IcebergAwsClientFactoryto create AWS clients in Iceberg REST Catalog to offer vended credentials. #58296 - Parquet Reader supports filtering data with Bloom Filter. #56445
- Supports automatically creating global dictionaries for low-cardinality columns in Parquet-formatted Hive/Iceberg tables during queries. #55167
Performance Improvement and Query Optimization​
- Statistics optimization:
- Supports Table Sample. Improved statistics accuracy and query performance by sampling data blocks in physical files. #52787
- Supports recording the predicate columns in queries for targeted statistics collection. #53204
- Supports partition-level cardinality estimation. The system reuses the system-defined view
_statistics_.column_statisticsto record the NDV of each partition. #51513 - Supports multi-column Joint NDV collection to optimize the query plan generated by CBO in the scenario where columns correlate with each other. #56481 #56715 #56766 #56836
- Supports using histograms to estimate the Join node cardinality and in_predicate selectivity, thus improving the estimation accuracy in data skew. #57874 #57639
- Optimized Query Feedback. Queries with the identical structure but different parameter values will be categorized as the same type and share the same tuning guide for plan execution optimization. #58306
- Supports Runtime Bitset Filter as an alternative for optimization to Bloom Filter in specific scenarios. #57157
- Supports pushing down Join Runtime Filter to the storage layer. #55124
- Supports Pipeline Event Scheduler. #54259
Partition Management​
- Supports using ALTER TABLE to merge expression partitions based on time functions for optimized storage efficiency and query performance. #56840
- Supports partition Time-to-live (TTL) for List-partitioned tables and materialized views. And supports the property
partition_retention_conditionin tables and materialized views to allow users to set data retention strategies for list partitions, thus achieving more flexible partition deletion strategies. #53117 - Supports using ALTER TABLE to delete partitions specified by common partition expressions, allowing users to flexibly delete partitions in batches. #53118
Cluster Management​
- Upgraded FE compile target from Java 11 to Java 17 for better system stability and performance. #53617 #57030
Security and Authentication​
- Supports secure connections encrypted by SSL based on the MySQL protocol. #54877
- Enhanced authentication using external systems:
- Supports creating StarRocks users with OAuth 2.0 and JSON Web Token (JWT).
- Supports Security Integration to simplify the authentication process with external systems. Security Integration supports LDAP, OAuth 2.0, and JWT. #55846
- Supports Group Provider to obtain the user group information from external authentication services. The group information can then be used in authentication and authorization. Group Provider supports acquiring group information from LDAP, operating systems, or files. Users can query the user group they belong to using the function
current_group(). #56670
Materialized Views​
- Supports creating materialized views with multiple partition columns to allow users to partition the data with a more flexible strategy. #52576
- Supports setting
query_rewrite_consistencytoforce_mvto force the system to use the materialized view for query rewrite, thus keeping performance stability at the cost of data timeliness to a certain extent. #53819
Loading and Unloading​
- Supports pausing Routine Load jobs on JSON parse errors by setting the property
pause_on_json_parse_errortotrue. #56062 - [Beta] Supports transactions with multiple SQL statements (currently, only INSERT is supported). Users can start, apply, or undo a transaction to guarantee the ACID (atomicity, consistency, isolation, and durability) properties of multiple loading operations. #53978
Functions​
- Introduced the system variable
lower_upper_support_utf8on the session and global level, enhancing the support for UTF-8 strings (especially non-ASCII characters) in case conversion functions such asupper()andlower(). #56192 - Added new functions: