Graphical tool to manually label images in distinct categories to build training datasets. Simply pass a list of categories, a directory containing images and start labelling. Supports multiple users, reconciliation and keyboard bindings to label even faster!
Simplabel is on PyPI so it can be installed with pip
pip install simplabel Clone the repository to your computer
git clone https://github.com/hlgirard/Simplabel.git and install with pip
cd Simplabel pip install . Simplabel can be started from the command line without any argument:
simplabel You will be prompted to select a directory containing images to label. Add labels with the '+' button and start labeling. Number keys correspond to labels and can be used instead.
The target directory and/or labels can also be passed directly from the command line:
simplabel --labels dog cat bird --directory path/to/image/directory After the first use, labels are stored in labels.json and the --labels argument is ignored.
-d, --directory <PATH/TO/DIRECTORY>sets the directory to search for images and save labels to. Defaults to the current working directory.-l, --labels <label1 label2 label3 ...>sets the categories for the labelling task. Only passed on the first use in a given directory.-u, --user <USERNAME>sets the username. Defaults to the OS login name if none is passed.-r, --redundantdoes not display other labelers selections for independent labelling. Reconciliation and Make Master are unavailable in this mode.-v, --verboseincreases the verbosity level.--remove-label <LABEL>tries to safely remove a label from the list saved inlabels.json(must also pass-d)--reset-lockoverrides the lock preventing the same username from being used multiple times simultaneously.--delete-allremoves all files created by simplabel in the directory (must also pass-d)
The app relies on the filesystem to save each user's selection and display other user's selections. It works best if the working directory is on a shared drive or in a synced folder (Dropbox, Onedrive...). The Reconcile workflow allows any user to see and resolve conflicts. The Make Master option can be used to create and save a master dictionary - labeled_master.json - containing all labeled images (after reconciliation).
The app saves a labeled_<username>.json file that contains a jsonified dictionary {image_name: label}. To import the dictionary, use the following sample code:
import json with open("labeled_user1.json","rb") as f: label_dict = json.load(f)Once you are done labelling, use the flow_to_directory tool to copy images to distinct directories by label
flow_to_directory --input-directory data/labeled --output-directory data/sorted The Tkinter app can also be started from a python environment
from simplabel import ImageClassifier import tkinter as tk root = tk.Tk() directory = "data/raw" categories = ['dog', 'cat', 'bird'] MyApp = ImageClassifier(root, directory, categories) tk.mainloop()This project is licensed under the GPLv3 License - see the LICENSE.md file for details.
Testing of tkinter GUI is based on ivan_pozdeev's answer at Stackoverflow:
https://stackoverflow.com/questions/4083796/how-do-i-run-unittest-on-a-tkinter-app
