Empower your AI journey with essential toolkits. Access resources, templates and guides to accelerate development and streamline workflows.
Disclaimer :The data preprocessing tools provided on this platform are intended for general use and are based on feedback from the broader community.
ARX
Enterprise-grade tool for anonymizing structured sensitive personal information (PII).
4
12
291
View Tool
Authentication and API Key Setup: Secure your API integrations with AIKosh
Authentication ensures only trusted users access AIKosh APIs. With API keys, each request is verified, keeping your data safe and integrations secure from unauthorized access.
4
3
156
View Tool
Fetch Dataset List: Programmatically retrieve all Datasets by Organisation
The Fetch Dataset List API helps platforms get a quick list of all datasets, useful for integration, API key checks, and managing multiple datasets easily.
The Fetch File Download URL API gives secure, short-lived links to download datasets, helping you share files safely without using static links. It keeps your data delivery secure, scalable, and trackable.
1
1
8
View Tool
Fetch Dataset Preview – Deliver Data Snapshots via API
The Fetch Dataset Preview API shows a quick sample of data, helping users check format and relevance before downloading.
0
1
12
View Tool
Fetch Dataset Metadata – Enable Seamless Dataset Discovery via API
lets contributors share machine-readable dataset summaries, making them discoverable and usable without downloads. This blog covers its purpose, structure, implementation, and best practices.
1
1
43
View Tool
DataPrep
Lightweight Python library for data cleaning, profiling, and exploratory analysis, ideal for quick data preparation tasks.
1
0
19
View Tool
DataCleaner
Minimalist, command-line tool for basic data cleaning operations on CSV files — ideal for scripting and batch use.
0
0
31
View Tool
OpenRefine
Powerful desktop tool for cleaning, transforming, and reconciling messy tabular data (like CSVs and Excel files).
1
0
19
View Tool
Doccano
Lightweight and intuitive annotation tool for NLP tasks like text classification, named entity recognition, and sequence-to-sequence labeling.
1
0
24
View Tool
Shoonya
Open-source platform for multilingual text and speech annotation, designed specifically for Indian language datasets.
0
1
25
View Tool
OCR Toolkit
OCR utility for extracting printed or handwritten text from images, scans, and PDFs