The API provides a suite of specialized operations designed to optimize specific tasks.
The currently available operations include:
- compare-image: Compare the differences between two images, using various algorithms such as AE, MAE, NCC, PSNR, and RMSE.
- compare-video: Compare the differences between two videos.
- compress: Compress image files while maintaining a visually similar quality to the human eye.
- convert-pdfa: Convert a PDF to a PDF/A compliant format, with the option to select a specific PDF/A profile.
- validate-pdfa: Verify PDF/A compliance with a specific PDF/A profile for a given file.
- extract-archive: Extract the contents of a compressed archive.
- extract-streams: Extract individual streams (such as video, audio, and subtitles) from a multimedia file.
- merge-streams: Merge multiple streams (such as video, audio, and subtitles) into a single multimedia file, with the option to specify specific layouts such as 5.1 audio.
- speech-to-text: Convert audio to text.
- thumbnail: Create thumbnails of document files.
- ai_upscale: Use AI models to upscale an image.