BitSail is ByteDance's open source data integration engine which is based on distributed architecture and provides high performance. It supports data synchronization between multiple heterogeneous data sources, and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, it serves almost all business lines in ByteDance, such as Douyin, Toutiao, etc., and synchronizes hundreds of trillions of data every day. BitSail has been widely used and supports hundreds of trillions of large traffic. At the same time, it has been verified in various scenarios such as the cloud native environment of the volcano engine and the on-premises private cloud environment.

Features

  • Global Data Integration, covering batch, streaming and incremental scenarios
  • Distributed and cloud-native architecture, supporting horizontal scaling
  • High maturity in terms of accuracy, stability and performance
  • Rich basic functions, such as type conversion, dirty data processing, flow control, data lake integration, automatic parallelism calculation , etc.
  • Task running status monitoring, such as traffic, QPS, dirty data, latency, etc.
  • Mass data synchronization in heterogeneous data sources

Project Samples

Project Activity

See All Activity >

Categories

Data Pipeline

License

Apache License V2.0

Follow BitSail

BitSail Web Site

Other Useful Business Software
Train ML Models With SQL You Already Know Icon
Train ML Models With SQL You Already Know

BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of BitSail!

Additional Project Details

Programming Language

Java

Related Categories

Java Data Pipeline Tool

Registered

2023-06-12