Titan AI LogoTitan AI

doccano

10,386
1,823
Python

Project Description

Open source annotation tool for machine learning practitioners.

doccano: Open source annotation tool for machine learning practitioners.

Project Title

doccano — Open-Source Text Annotation Tool for Machine Learning Practitioners

Overview

doccano is an open-source text annotation tool designed for machine learning practitioners, offering features for text classification, sequence labeling, and sequence to sequence tasks. It enables users to create labeled data for various applications such as sentiment analysis and named entity recognition. With its user-friendly interface and collaborative annotation capabilities, doccano stands out for its ease of use and efficiency in dataset creation.

Key Features

  • Collaborative annotation
  • Multi-language support
  • Mobile support
  • Emoji support
  • Dark theme
  • RESTful API

Use Cases

  • Machine learning practitioners use doccano to create labeled datasets for training models in natural language processing tasks.
  • Researchers in linguistics and computational linguistics use it for text classification and sequence labeling tasks.
  • Data scientists employ doccano for sentiment analysis and text summarization projects.

Advantages

  • Rapid dataset creation with a user-friendly interface.
  • Supports a wide range of annotation tasks, making it versatile for different machine learning applications.
  • Open-source, allowing for community contributions and customization.

Limitations / Considerations

  • The project's license is currently unknown, which might affect its use in commercial applications.
  • As with any annotation tool, the quality of the annotated data depends on the annotators' expertise and consistency.

Similar / Related Projects

  • Prodi.gy: A commercial platform for data annotation that offers a wide range of features but is not open-source.
  • Label Studio: An open-source annotation tool that supports various data types and machine learning models, differing in its focus on interoperability with different models.
  • Brat: A web-based tool for text annotation, known for its simplicity but lacking some of the advanced features found in doccano.

Basic Information


📊 Project Information

  • Project Name: doccano
  • GitHub URL: https://github.com/doccano/doccano
  • Programming Language: Python
  • ⭐ Stars: 10,278
  • 🍴 Forks: 1,811
  • 📅 Created: 2018-05-09
  • 🔄 Last Updated: 2025-09-23

🏷️ Project Topics

Topics: [, ", a, n, n, o, t, a, t, i, o, n, -, t, o, o, l, ", ,, , ", d, a, t, a, -, l, a, b, e, l, i, n, g, ", ,, , ", d, a, t, a, s, e, t, ", ,, , ", d, a, t, a, s, e, t, s, ", ,, , ", m, a, c, h, i, n, e, -, l, e, a, r, n, i, n, g, ", ,, , ", n, a, t, u, r, a, l, -, l, a, n, g, u, a, g, e, -, p, r, o, c, e, s, s, i, n, g, ", ,, , ", n, u, x, t, ", ,, , ", n, u, x, t, j, s, ", ,, , ", p, y, t, h, o, n, ", ,, , ", t, e, x, t, -, a, n, n, o, t, a, t, i, o, n, ", ,, , ", v, u, e, ", ,, , ", v, u, e, j, s, ", ]


🎮 Online Demos

📚 Documentation

  • [Codacy Badge
  • [AWS CloudFormation Launch Stack SVG Button
  • [Deploy
  • [GCP Cloud Run PNG Button

This article is automatically generated by AI based on GitHub project information and README content analysis

Titan AI Explorehttps://www.titanaiexplore.com/projects/doccano-132709824en-USTechnology

Project Information

Created on 5/9/2018
Updated on 11/12/2025