superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.
-
Updated
Apr 22, 2026 - TypeScript
superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.
Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.
an app engine for your business. Seamlessly implement business logic with a powerful API. Out of the box CMS, blog, forum and email functionality. Developer friendly & easily extendable for your next SaaS/XaaS project. Built with Rails 6, Devise, Sidekiq & PostgreSQL
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
[EOL] Real-Time Event Streaming & Change Data Capture
[DEPRECATED - superseded by Agnostic Data Labs] The Virtual Data Warehouse is a code generation and template management tool.
Generic interface exchange format for data automation and code generation.
The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing, which takes raw data, cleans it and stores it for later use. The extraction phase targets and retrieves the data. Transform manipulates and cleans the data. Then load stores the data, typically in a data wareh…
Data Engineering portfolio projects, resources used to study data tools...
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, audit and control data integration / ETL processes.
proof of concept to generate Airbyte low-code YAML connectors from API documentation
Apache Arrow Guide
Automatically download and transform Hetzner invoices.
Amazon Redshift Serverless RSQL ETL Framework
An ASP NET MVC 6 Web GUI (Net core) for easy reports generation using ReportGenerator
PHP ETL engine with pluggable steps: extractors, transformers, loaders
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
A comprehensive ETL pipeline and sales analysis project leveraging Microsoft Azure and PySpark, designed to optimize e-commerce sales by providing actionable insights through detailed data analysis.
Add a description, image, and links to the etl-automation topic page so that developers can more easily learn about it.
To associate your repository with the etl-automation topic, visit your repo's landing page and select "manage topics."