Studio

Studio is both the web management console for Singdata Lakehouse and a serverless data services platform. The data sync jobs, SQL scheduling tasks, and Python scripts you configure in the UI are automatically executed by Singdata-managed serverless infrastructure — no compute resources to manage. Data engineers, analysts, and administrators can handle the full pipeline from data ingestion to data consumption in a single interface, without switching between tools.

Data Agent — AI Assistant

Studio has a built-in Data Agent — a fully AI-driven interaction layer built on top of Lakehouse, covering the full lifecycle of development, operations, and governance. You can tell the Agent what to do in natural language, and it operates the platform on your behalf — shifting from "person operates platform" to "person directs Agent."

Typical use cases:

  • ETL development assistance: Describe your data processing requirements, and the Agent generates SQL tasks and configures scheduling
  • Natural language data retrieval: Ask "What are the order amounts by region for the last 7 days?" and the Agent translates it to SQL and returns results
  • Day-to-day operations Q&A: Ask "Which tasks are currently failing?" and the Agent queries and provides remediation suggestions
  • Data insight generation: Upload data or specify a table, and the Agent automatically generates an analysis report

Six Core Modules

Data Sync
Visually configure real-time CDC sync and offline batch sync for 40+ data sources — no code required. Supports MySQL, PostgreSQL, Oracle, Kafka, and other mainstream sources.

Data Sync Overview

Task Development (IDE)
Built-in SQL / Python / Shell editors with support for composite tasks, loop tasks, and conditional branching — develop your data transformation logic end-to-end in one place.

Task Development and Scheduling

Task Scheduling
Cron-based scheduling with upstream/downstream dependency management, support for historical backfill, and real-time visibility into task execution status.

Task Scheduling Dependencies

Workspace (SQL Query)
Interactive SQL editor with multi-tab support, result visualization, and query history — ideal for ad-hoc analysis and data exploration.

Workspace

Operations & Monitoring
View task run history, logs, and failure reasons. Configure alert notifications and stay on top of your data pipeline health.

Task and Instance Operations

Data Catalog
Browse and manage table schemas, column comments, and data lineage. Supports permission requests and approval workflows.

Data Catalog


I want to sync external data in

Recommended: Data Sync module — supports 40+ data sources with visual configuration, no code required.

ScenarioApproachReference
Relational databases (MySQL / PG / Oracle, etc.), real-time syncReal-time sync task (CDC)Real-time Sync Task
Full database sync, migrating multiple tables at onceMulti-table real-time syncMulti-table Real-time Sync Guide
Offline periodic sync (T+1 or hourly)Offline sync taskOffline Sync Task
Not sure which data source to useCheck supported sourcesData Source Support

I want to develop data processing tasks

Recommended: Task Development module (IDE) — supports SQL, Python, and Shell with composable orchestration.

ScenarioApproachReference
SQL-based data transformationSQL taskTask Development and Scheduling
Python processing logicPython taskPython Task
Multiple tasks in sequence with dependenciesComposite taskComposite Task
Loop over multiple partitions or objectsFor each loop taskFor Each Loop Task
Conditional branching (if/else logic)Branch taskBranch Task
Incremental computation with auto-maintained result tablesDynamic table taskDynamic Table Task

I want to configure scheduled execution

Recommended: Task Scheduling module — supports Cron expressions and upstream/downstream dependency management.

ScenarioReference
Set up timed task executionTask Scheduling and Instance Execution
Configure dependencies between tasksTask Scheduling Dependencies
Backfill historical dataData Backfill

I want to check task run status / troubleshoot issues

Recommended: Operations & Monitoring module.

ScenarioReference
View task run history and logsTask and Instance Operations
Configure alert notificationsMonitoring and Alerting
Understand monitoring metric definitionsMonitoring Metric Specifications

I want to query and analyze data

ScenarioApproachReference
Interactive SQL queriesWorkspaceWorkspace
Visualize query resultsAnalysis (Notebook)Analysis
View and manage table schemasData CatalogData Catalog

I want to manage compute resources

ScenarioReference
Create, start, or stop a VClusterCompute Clusters
View historical job resource consumptionJob History
Understand cluster size codesCompute Cluster Size Reference

I want to manage permissions and approvals

ScenarioReference
Request data access permissionsPermission Requests
Process approval ticketsApproval Tickets

New to Studio?

Follow this sequence to get started:

  1. Lakehouse Studio Quick Tour — 5-minute overview of the interface layout
  2. Lakehouse Studio Getting Started Guide — complete your first end-to-end workflow
  3. Studio Overview — deep dive into each module's capabilities