2025.08.25 Lakehouse Studio Release Notes

In this release (Release 2025.08.25, v1.9.1), we have introduced a series of new features, enhancements, and fixes. Please note that these updates will be rolled out gradually to the following regions over one to two weeks starting from the release date, depending on your region:

  • Tencent Cloud Shanghai
  • Tencent Cloud Beijing
  • Tencent Cloud Guangzhou
  • Alibaba Cloud Shanghai
  • Alibaba Cloud Singapore
  • AWS Singapore
  • AWS Beijing

Incompatible Changes

  • Monitoring & Alerting: Adjusted the scope of the monitoring event "Task Instance Run Failure". Instances that are blocked due to upstream instance suspension or terminated and marked as failed due to timeout will no longer trigger instance run failure alerts. A new separate alert "Scheduled Instance Wait Timeout Failure" has been added, which requires separate configuration.

New Features

  • Product-overall:

    • Enhanced the onboarding experience for new users, including providing new user tutorial videos to help users understand product features. Restructured information layout on the instance homepage and strengthened quick access guidance for commonly used functions.
    • Provided a global fast-track channel for issue feedback.
  • Account System:

    • Added invitation code functionality (optional), supporting entry of invitation codes for specific promotional activities to receive activity rewards.
    • SSO single sign-on authentication now supports OAuth 2.0 protocol integration, currently available in grayscale rollout (contact Clickzetta sales to apply). Currently compatible with identity services supporting OAuth 2.0, OpenID, and SAML 2.0 protocols. After enabling single sign-on, web access only allows SSO authentication; JDBC and SDK still require username + password authentication.
    • Administrators can now modify phone/email information on behalf of other users.
  • Workspace:

    • Added batch submit and batch modify scheduling configuration capabilities, improving code management and development efficiency for developers in development scenarios.
  • Data Sync:

    • Multi-table Real-time Sync: Added support for full database mirror sync mode, which can automatically detect and adapt to new tables added under the source database for synchronization.
    • Multi-table Real-time Tasks: In mirror sync mode, supports detection of source-side table truncation behavior and corresponding cleanup of data in target tables.
    • Multi-table Real-time Sync: Failover information in the task details page in Operations Center is now persistently saved and can be viewed after task restarts.
    • Batch Sync: PostgreSQL target now supports preprocessing operations (Pre-SQL) for clearing source data in advance.
    • Batch Sync: SLS source now supports data filtering using specified query conditions.
  • Task Development:

    • Added run preference settings capability, providing two user preference options: global run or single code block run, improving the user experience in development scenarios.
    • Built-in parameters and function expressions now support multiple offsets. Based on date expressions/timestamp functions combined with multiple offsets, you can express advanced parameter values such as the day before last year, or a timestamp from one hour before last week. See: Task Parameters
  • Task Scheduling:

    • Enhanced advanced scheduling capabilities, including delayed run dry-run skip, optimized instance generation strategy, support for setting exclusion times, and new dependency strategies such as forward dependency and forward nearest dependency. Adapts to personalized scheduling execution strategy configuration needs in different business scenarios. See: Task Development and Scheduling
  • Operations Center: Added task file version information to the scheduled instance list, facilitating version identification for instances and rollback operations.

  • Data Upload: Added support for uploading local files in JSON/JSONL format.

Optimizations

  • Product-wide: Adjusted scheduling cycle display information content according to new rules for scheduling configuration time attributes, with scope covering workspace, task group list, Operations Center, etc.

  • Data Sources:

    • Provided targeted input field placeholder hints for different data source creation to assist users in entering accurate information and reduce understanding costs.
    • Optimized parameters in MySQL connection strings, providing default parameter information after the connection string, such as CONVERT_TO_NULL, useSSL, etc.
  • Data Sync:

    • ElasticSearch source: Supports adaptation to dynamically appearing keys in JSON. When keys do not exist, compatible handling is performed to avoid task failures.
    • Optimized user experience in integration task configuration. When switching tables, the data integration task cache is not refreshed.
  • Task Scheduling: Supports maintaining continuous instance generation when task scheduling start time is not on the hour. For example, if scheduling starts at 2:20 with 10-minute intervals, instances at 3:00, 3:10, 4:00, 4:10, etc. will also be generated (previously they would not be generated).

  • Operations Center:

    • Optimized the coverage scope of pause and resume task operations on instances. When pausing tasks, the scope of paused instances changed from instances on the day of operation (previous approach) to instances after the operation time. When resuming tasks, by default only task instances after the operation time are resumed.
    • Added Tooltip display of specific sub-statuses for instance states, allowing quick viewing of detailed status classifications, such as dry-run marked as success.
    • Added display of task scheduling timing information in the task instance details page.
    • Merged the last run time and status display columns in the scheduled task list to improve visual reading efficiency.
    • In multi-table real-time sync operations, execution logs now display more error information to facilitate troubleshooting and problem localization.
  • Monitoring & Alerting:

    • Instances blocked due to upstream instance suspension or killed due to timeout will not trigger instance run failure alerts.
    • Added display of instance scheduled timing in monitoring alert information for scheduled task instances to help identify the task corresponding to the alert.
  • Job History: Optimized user experience on the job history details page, including optimized cluster CRU display page and handling of invalid URL redirects.

Bug Fixes

  • Data Sync:

    • Fixed issue where tinyint(1) values synced in real-time from MySQL were inconsistent with the source table.
    • Fixed issue where multi-table real-time tasks started from a specific position would trigger underlying table rebuilds in special cases. Adjusted to not perform rebuilds.
  • Cluster Management: Fixed issue where VC page details could not be displayed when jobs were queued.

  • Monitoring & Alerting: Fixed issue with suspected false triggers of task run delay alerts.

Known Limitations

  • There are no known major limitations introduced in this release.