Image Selection

No selection loaded

Select a selection from the dropdown above to work with specific images

AI Tools Overview

Use these AI-powered tools to enhance your image metadata. Each tool will process only the images from your selected selection. Make sure to select a selection first!

Image Cropping Tool

The foundation of your workflow - crop images to extract text and prepare them for AI processing

ESSENTIAL

What it does:

  • Interactive cropping interface for precise text extraction
  • Creates cropped versions for AI text extraction
  • Essential first step for all AI processing

Workflow:

1 Crop images to extract text areas
2 Use AI tools below to process cropped images
3 Generate metadata and descriptions

Distribution Files Check

Verify the existence of EPS, PNG, and SVG files in your distribution folders

FILE SYSTEM

What it does:

  • Scans 04_Distribution/EPS, PNG, SVG folders
  • Updates EPS, PNG, SVG ready flags in database
  • No AI required - pure file system check

File Structure:

📁 Parent Directory
📁 01_Raw_Scans (source images)
📁 04_Distribution
📁 EPS, PNG, SVG folders

PNG to JPG Conversion

Convert PNG files to JPG format for images that have PNG files ready

CONVERSION

What it does:

  • Processes images with png_ready=True in database
  • Converts PNG files to high-quality JPG format with 100px white padding
  • Saves JPG files in 04_Distribution/JPG folder
  • Updates jpg_ready flag in database

File Structure:

📁 04_Distribution
📁 PNG (source files)
📁 JPG (output files)

Image to PDF Conversion

Convert images to PDF format - one image per PDF with professional layout

PDF

What it does:

  • Processes images with png_ready=True or jpg_ready=True
  • Creates one PDF per image with A4 page size
  • Centers images on page with professional margins
  • Saves PDF files in 04_Distribution/PDF folder
  • Updates pdf_ready flag in database

File Structure:

📁 04_Distribution
📁 PNG/JPG (source files)
📁 PDF (output files)

AI Service Configuration

Before using AI tools, test your AI service configuration to ensure it's working properly. Note: AI tools require either a local LLM (LM Studio) or OpenAI API to be configured and running.

Important: Text Extraction & Language Detection Requirements

The Text Extraction and Language Detection tools below have different requirements. Workflow: First crop your images, then extract text, then detect the language of the extracted text.

  • Text Extraction: Images must have cropped versions available (created using the Image Cropping Tool)
  • Language Detection: Images must have caption text already extracted (use Text Extraction first)

Text Extraction

OCR

Extract text content from cropped images using AI-powered OCR. This tool processes images that have been cropped and saves the extracted text to the caption field.

Processes: Images with cropped versions
Output: Caption text field

Language Detection

NLP

Automatically detect the language of caption text extracted from images. This helps categorize images by language for better organization.

Processes: Images with caption text
Output: Caption language field

Description Generation

AI

Generate detailed descriptions of images using AI vision analysis. Creates comprehensive descriptions based on visual content.

Processes: All images in current filter
Output: Description field

JPG Description Generation

AI

Generate specialized descriptions for JPG files created from PNG conversion. Optimized for black-and-white vectorial illustrations focusing on style and composition.

Processes: Images with JPG files ready
Output: Description JPG field

Short JPG Description Generation

AI

Generate short, concise descriptions for JPG files that already have long descriptions. Creates 2-sentence descriptions directly describing the subject without introductory phrases.

Processes: Images with JPG ready AND long JPG descriptions
Output: Short Description JPG field
Won't overwrite existing short descriptions

AI Tags Generation

Tags

Generate relevant tags for images using AI analysis. Creates searchable tags based on visual content and context.

Processes: All images in current filter
Output: AI tags field

Requirement: This tool will only process images that have existing caption text (extracted from previous OCR processing).

English Text Generation From Caption

Translation

Translate captions from other languages to English. Converts non-English caption text into English for better searchability.

Processes: Images with non-English captions
Output: English text field (translated from caption)

English Text from JPG Description

Generation

Generate English text from existing JPG descriptions. Creates a concise 5-word English text summary based on the JPG description field.

Processes: Images with JPG descriptions but missing English text
Output: English text field

English Text from Filename

Translation

Translate original filenames to English. Converts non-English filenames to English or copies English filenames as-is.

Processes: Images with filenames but missing English text
Output: English text field (translated from filename)

Create New Filenames

Naming

Generate web-safe snake_case filenames from English text. Creates descriptive, searchable filenames for better organization.

Processes: Images with English text but missing new_filename
Output: new_filename field (snake_case)

Metadata Processing

YAML

Process metadata.yaml files to extract book information and populate database fields. Automatically finds and parses metadata files from the parent directory of "01_Raw_Scans".

Processes: Selected images with metadata.yaml files
Output: Book metadata fields (title, author, publisher, etc.)

Error Checking

Validation

Check for errors and inconsistencies in AI-generated content. Validates and corrects issues in descriptions, tags, and text fields.

Processes: Images with AI fields
Output: Corrected AI fields