Overview
Live Inspect allows you to monitor and debug your pipelines by viewing live data samples as it flows through each topology node (sources, transforms, and sinks). This is invaluable for verifying data transformations, debugging issues, and understanding how your pipeline processes data in real-time.Basic Usage
Inspect live data from a running pipeline:Command Options
Pipeline name or path to YAML configuration file
Comma-separated list of topology node keys to filter. Only shows data from the specified nodes. Example:
-n source1,transform1Path to config file
Filtering Topology Nodes
By default, Live Inspect shows data from all nodes in your pipeline. Filter to specific nodes using the-n flag:
Sampling
Live Inspect uses intelligent sampling to display data from your pipeline without overwhelming the output. By default, it shows:- 15 records per topology node: A representative sample of the most recent records flowing through each node (source, transform, or sink)
- 30-second refresh interval: Data updates every 30 seconds (only when data is flowing through the topology node) to provide real-time visibility while maintaining reasonable resource usage
The sampling behavior can be tuned to suit particular use cases, such as higher-frequency updates for debugging or different sample sizes for specific data volumes. If you need to adjust the sampling behavior for your use case, please reach out for support.
Example: Inspecting a Pipeline
Consider a pipeline that filters ERC-20 transfers:polygon_erc20_transfers(source)filtered_transfers(transform)postgres_sink(sink)
Use Cases
Debugging Transform Logic
Debugging Transform Logic
Use Live Inspect to verify that your SQL transforms are filtering and transforming data correctly. Check the input and output of each transform to ensure your logic works as expected.
Verifying Data Sources
Verifying Data Sources
Quickly verify that your sources are producing data in the expected format and with the correct schema before it flows through transforms.
Troubleshooting
No Data Appearing
No Data Appearing
If you don’t see any data:
- Verify your pipeline is running:
goldsky turbo list - Check that data is flowing:
goldsky turbo logs my-pipeline - Ensure you’re using the correct pipeline name