Forensics Analyzer Documentation

Version: Image Forensics v3.0.0 | Video Forensics v2.5.0

This guide helps you understand the analysis reports from ImageForensicsAnalyzer and VideoForensicsAnalyzer. Each section explains what the signatures mean and how to interpret the results.



1. Overview: How the Analyzers Work

What These Tools Do

The forensics analyzers examine files at the binary level to detect signs of AI generation, manipulation, or editing. They look for:

  • Embedded signatures - Text strings left by software tools
  • Metadata patterns - Information stored within files about how they were created
  • Structural anomalies - Technical characteristics that differ from authentic camera/recording content
  • Encoding fingerprints - Specific compression patterns associated with AI pipelines

What the Report Tells You

A forensics report provides:

  • A risk assessment indicating likelihood of AI generation
  • A list of detected signatures with confidence levels
  • Technical details about the file's encoding and metadata
  • For videos: frame-by-frame analysis of visual patterns

Important Limitations

No forensic tool is 100% accurate. Consider these factors:

  • Legitimate editing software may trigger some signatures
  • AI-generated content can be "cleaned" to remove obvious markers
  • New AI tools may not yet be in the signature database
  • Low-confidence signatures should be treated as informational only

2. Understanding Confidence Levels

Every detected signature includes a confidence rating:

LevelMeaningWhat to Do
Very HighNear-certain identification. The signature is unique to a specific AI tool or manipulation software.This is strong evidence. The file very likely came from the identified source.
HighStrong indicator. The signature is rarely found in authentic camera/recording content.Take seriously. Look for corroborating factors in the report.
MediumPossible indicator. The signature may appear in both AI and legitimate content.Consider alongside other evidence. Don't rely on this alone.
LowWeak indicator. Common in many types of files.Informational only. Not reliable for determining AI generation.

3. Risk Assessment Explained

The analyzers calculate an overall risk score based on multiple factors.

Risk Levels

LevelScore RangeInterpretation
High50-100Strong evidence of AI generation or significant manipulation. Multiple high-confidence indicators present.
Medium30-49Moderate evidence. May include AI tool signatures or suspicious encoding patterns. Warrants further investigation.
Low-Medium15-29Some indicators present, but not conclusive. Could be legitimate editing software or partial AI assistance.
Low0-14Minimal indicators. Appears to be authentic or processed with standard tools.

Factor Impacts

Each factor in the report is weighted by its significance:

  • Critical - Direct attribution to an AI tool (e.g., "Tool: Runway ML")
  • High - Strong AI indicators (signatures, suspicious encoding)
  • Medium - Supporting evidence (ML frameworks, cloud platforms)
  • Low - Minor indicators (generic patterns)
  • Info - Neutral information (C2PA present, encoder type)

4. Image Forensics Signatures

AI Generation Signatures (Strong)

These signatures are strong evidence of AI image generation. They rarely appear in photos from cameras or standard editing software.

SignatureDescriptionWhat It Means
MidjourneyText-to-image AI serviceImage was created or processed by Midjourney's AI
Stable DiffusionOpen-source AI image generatorImage was generated using Stable Diffusion models
DALL-EOpenAI's image generation AIImage was created by OpenAI's DALL-E system
NovelAIAI art generation serviceImage originated from NovelAI's generators
Automatic1111Popular Stable Diffusion interfaceGenerated using the Automatic1111 web UI
ComfyUINode-based AI generation interfaceCreated using ComfyUI workflow system
InvokeAIStable Diffusion distributionGenerated using InvokeAI software
DreamStudioStability AI's web serviceCreated using Stability AI's platform
Bing Image CreatorMicrosoft's AI image toolGenerated through Microsoft's Bing AI
Adobe FireflyAdobe's generative AICreated using Adobe's Firefly AI tools
SDXLStable Diffusion XL modelGenerated using the larger SDXL model
Flux.1 / FLUXModern diffusion modelGenerated using Flux AI models
FooocusSimplified SD interfaceCreated using Fooocus generator
IDEOGRAM / ideogram.aiText-focused AI generatorCreated using Ideogram's AI service
RecraftAI design toolGenerated using Recraft AI
Imagine with MetaMeta's AI image generatorCreated through Meta's AI tools
Google Inc. 2016ICC Profile signature (UTF-16 BE)Strong indicator of Google AI (Gemini/Imagen) origin
GeminiGoogle's multimodal AIImage processed or generated by Google Gemini
ImagenGoogle's image generation AICreated using Google's Imagen system
DiffusersHuggingFace libraryGenerated using HuggingFace Diffusers library
HuggingFaceML model repositoryCreated using models from HuggingFace
CivitaiAI model sharing platformGenerated using models from Civitai

Editing/Manipulation Signatures

These indicate the image was edited with professional software. Not necessarily AI-generated, but has been modified.

SignatureCategoryWhat It Means
Adobe PhotoshopPhoto EditingImage was edited in Photoshop
Adobe LightroomPhoto EditingImage was processed in Lightroom
GIMPPhoto EditingEdited with the free GIMP software
Affinity PhotoPhoto EditingProcessed with Affinity Photo
PixelmatorPhoto EditingEdited using Pixelmator (Mac)
Paint.NETPhoto EditingProcessed with Paint.NET
Capture OneRAW ProcessingProfessional RAW development software
SnapseedMobile EditingEdited using Google's Snapseed app
MeituBeauty AppProcessed with Meitu beautification app
FaceAppFace ModificationFace was modified using FaceApp AI

Deepfake/Face Manipulation Signatures

These are serious indicators of facial manipulation or synthetic face generation.

SignatureDescriptionSeverity
DeepFaceLabFace-swapping softwareCritical - Direct deepfake tool
FaceSwapOpen-source face-swapCritical - Explicit face manipulation
RoopOne-click face swapCritical - Face replacement detected
RefaceFace-swap appHigh - Mobile face-swap tool
SimSwapAI face swappingCritical - Advanced face manipulation
InsightFaceFace analysis libraryHigh - Often used for face swapping
GFPGANFace restoration AIMedium - Face enhancement (may be legitimate restoration)
CodeFormerFace restoration AIMedium - Advanced face enhancement
Real-ESRGANImage upscaling AILow - May be legitimate upscaling
LensaAI portrait appHigh - AI-enhanced/generated portraits
ReminiPhoto enhancement AIMedium - AI enhancement applied

Pattern-Based Detection (Regex)

The analyzer also detects AI signatures through patterns rather than exact text matches:

Pattern TypeWhat It DetectsExample Match
Midjourney FilenameUUID patterns with "_mj_" markerabc123_mj_image.png
Midjourney ParametersCommand-line style generation parameters--v 5, --ar 16:9
SD Generation InfoAutomatic1111 generation metadataSteps: 30, Sampler: Euler
CFG ScaleGuidance scale parameterCFG scale: 7.5
Model HashAI model identificationModel hash: a1b2c3d4
ComfyUI WorkflowJSON workflow structures"class_type": "KSampler"
DALL-E SignatureOpenAI generation markerGenerated by DALL·E
Prompt/Negative PromptGeneration prompts in metadataPrompt: beautiful landscape...
Seed ValueRandom seed for generationSeed: 123456789

5. Video Forensics Signatures

AI Video Generation Tools

These signatures indicate the video was generated by AI systems.

SignatureCompanyDescription
Runway ML / Gen-2 / Gen-3RunwayLeading AI video generation platform. Creates videos from text or images.
Pika LabsPikaText-to-video AI. Known for creative video generation.
OpenAI Sora / Sora1OpenAIOpenAI's advanced video generation AI.
SynthesiaSynthesiaAI avatar video platform. Creates videos of synthetic humans speaking.
HeyGenHeyGenAI avatar videos with voice cloning.
D-IDD-IDTalking head AI. Animates faces from photos.
ColossyanColossyanEnterprise AI avatar platform.
Stable Video DiffusionStability AIOpen-source video generation model.
Google Veo / Lumiere / PhenakiGoogleGoogle's video generation AI systems.
Meta Make-A-Video / Emu VideoMetaMeta's video generation research projects.
AnimateDiffOpen SourceAnimation framework for Stable Diffusion images.
ModelScope / ZeroScopeVariousOpen-source text-to-video models.
CogVideo / VideoCrafterResearchAcademic video generation models.
Luma AI / Dream MachineLumaAI video and 3D generation platform.
Kaiber AIKaiberAI video generation for music and art.
Kling AIKuaishouChinese AI video generation platform.
MiniMax / HailuoMiniMaxChinese AI video generation (Hailuo AI).
Haiper AIHaiperAI video creation platform.
PixVersePixVerseAI video generation service.
Deforum / Warp FusionOpen SourceAnimation tools for AI image sequences.

Deepfake/Face Manipulation (Video)

SignatureTypeRisk Level
DeepFaceLabFace SwapCritical - Most common deepfake tool
FaceSwap / FaceFusionFace SwapCritical - Open-source face replacement
Wav2LipLip SyncCritical - AI lip synchronization
RoopFace SwapCritical - Single-image face swap
First Order MotionAnimationHigh - Animates images from video
SimSwap / InsightFaceFace SwapCritical - Advanced face manipulation
SadTalker / MakeItTalkTalking HeadHigh - Animates portraits to speak
Audio2FaceFacial AnimationHigh - Audio-driven face animation
LivePortraitPortrait AnimationHigh - Real-time portrait animation

Cloud AI Platforms

Detection of cloud infrastructure used for AI processing:

  • Replicate - Model hosting platform (often runs AI video models)
  • FAL.ai - AI inference platform
  • Modal - Cloud compute for AI
  • Together AI - AI model hosting
  • Fireworks AI - Fast AI inference
  • HuggingFace - AI model repository and hosting
  • Gradio - AI demo interfaces (indicates AI processing)
  • RunPod / Vast.ai - GPU cloud providers

ML Framework Signatures

Presence of machine learning libraries may indicate AI processing:

  • PyTorch / TensorFlow / Keras - Deep learning frameworks
  • HuggingFace Diffusers - AI image/video generation library
  • Safetensors - AI model format
  • ComfyUI / Automatic1111 - AI generation interfaces
  • ControlNet / IP-Adapter - AI conditioning techniques
  • LoRA - AI model fine-tuning method

Binary Signatures

Some AI tools leave binary (non-text) patterns in video headers:

SignaturePatternMeaning
Kling AI SPS95 90 05 00 5b b0 11Kling AI's H.264 encoder signature in video stream parameters

6. Encoding Analysis (Video)

What Encoding Analysis Reveals

AI video generation tools typically use fast, simple encoding settings. The analyzer looks at x264/x265 encoder parameters to detect patterns associated with AI pipelines.

Suspicious Encoding Options

These encoding settings, when found together, suggest AI generation:

OptionSuspicious ValueWeightWhy It's Suspicious
scenecut04Scene detection disabled. AI generates frame-by-frame, doesn't need scene cuts.
bframes03No bidirectional frames. AI generates sequential frames without temporal prediction.
subme03No subpixel motion estimation. AI content has no real motion to estimate.
cabac02CABAC disabled for speed. AI pipelines prioritize fast encoding.
ref12Single reference frame. AI doesn't benefit from multiple references.
mbtree01Macroblock tree disabled. Not needed for AI-generated content.
trellis01Trellis optimization disabled for speed.
8x8dct018x8 discrete cosine transform disabled.
weightp01Weighted prediction disabled.
mixed_ref01Mixed references disabled.

Encoding Score Thresholds

ScoreLikelihoodInterpretation
0NoneNormal encoding parameters
1-7LowSome fast-encode options, possibly legitimate
8-11MediumSuspicious pattern of options
12-15Medium-HighLikely AI encoding pipeline
16+HighStrong evidence of AI encoding

Truncated Encoder String

When the x264 encoder string doesn't include the "options:" section, it often indicates the video passed through a cloud AI pipeline that strips this information.


7. Frame Analysis (Video)

What Frame Analysis Measures

The analyzer extracts frames from the video and measures statistical properties that differ between AI-generated and real video.

Per-Frame Metrics

MetricWhat It MeasuresAI Indicator
Color Stats (mean, std)Average color and variation per channel (R, G, B)AI videos often have unnaturally consistent colors
Edge DensityAmount of sharp edges in the frameAI may have unusual edge patterns
Noise EstimateBackground noise levelAI generates unnaturally uniform noise
Histogram EntropyInformation density (0-8 bits)AI may have unnatural distribution
Banding ScoreColor quantization artifactsAI often produces color banding

Cross-Frame Analysis

AnalysisDescriptionSuspicious Threshold
Color ConsistencyRange of color variation across frames<15 total range indicates synthetic content
Noise PatternsVariance of noise across frames<0.5 variance indicates uniform AI noise
Banding AnalysisAverage banding artifacts>20 average indicates AI color issues

Detected Anomalies

  • color_consistency - "Unusually consistent colors (synthetic)" - Colors don't vary naturally
  • uniform_noise - "Uniform noise (uncommon natural)" - Noise is too consistent across frames
  • color_banding - "Color banding (AI common)" - Visible steps in color gradients

8. C2PA Content Credentials

What is C2PA?

C2PA (Coalition for Content Provenance and Authenticity) is an industry standard for embedding verifiable information about how content was created. Major companies like Adobe, Microsoft, Google, and OpenAI are adopting this standard.

C2PA Indicators

IndicatorMeaningSignificance
c2paC2PA Content Credentials presentFile contains provenance data
jumb / JUMBFJUMBF container foundISO standard container for C2PA data
trainedAlgorithmicMediaIPTC AI-Generated flagHigh confidence: Explicitly marked as AI-generated
compositeWithTrainedAlgorithmicMediaIPTC AI-Composite flagContains AI-generated elements mixed with other content
algorithmicMediaIPTC Algorithmic flagCreated using algorithmic/AI processes
digitalSourceTypeSource type declarationDescribes origin (camera, AI, composite, etc.)
softwareAgentCreation softwareNames the tool that created the content
truepicTruepic signingContent signed by Truepic verification service

Interpreting C2PA Results

  • C2PA present + AI flag: Content creator has honestly declared AI generation
  • C2PA present, no AI flag: May be from a camera or editing software
  • C2PA absent: No provenance data (neither confirms nor denies AI)

9. Filename Pattern Detection

Why Filenames Matter

AI tools often generate files with distinctive naming patterns. Even if metadata is stripped, the filename may reveal the source.

Known Filename Patterns

PatternSourceExampleConfidence
Generated_Image_[Month]_[Day]_[Year]Google GeminiGenerated_Image_November_14__2025_-_1_15PM.pngVery High
UUID with _mj_ markerMidjourneya1b2c3d4_mj_5678.pngHigh
DALL·E or DALL-E variantsDALL-E / OpenAIDALL·E_2025_image.pngVery High
00001-1234567890-promptStable Diffusion (Auto1111)00001-123456789-beautiful_landscape.pngHigh
comfyui_[number]ComfyUIcomfyui_00001.pngVery High
leonardo_ai_[...]Leonardo.AIleonardo_ai_creative_123.pngHigh
OIG.[...] or bing_imageBing Image CreatorOIG.abc123.jpgHigh
firefly_[...] or adobe_fireflyAdobe Fireflyfirefly_generated_image.pngHigh
flux_dev or flux_schnellFluxflux_dev_output.pngMedium
nightcafe_[...]NightCafenightcafe_studio_art.pngHigh

Video-Specific Filename Patterns

PatternSourceScore Impact
kling_[...]Kling AI+40 points
runway_[...]Runway ML+40 points
pika_[...]Pika Labs+40 points
sora_[...]OpenAI Sora+40 points
synthesia_[...]Synthesia+40 points
text_to_video or txt2vidAny T2V generator+20 points
ai_generatedGeneric AI+20 points
YYYYMMDD_HHMM_ timestampAI service export+10 points
24+ character hex IDAI service export+15 points

10. Quantization Tables (Images)

What Are Quantization Tables?

JPEG images use quantization tables during compression. Different software uses different tables, creating a "fingerprint" that can identify the source application.

Known Quantization Tables

Table NameSourceSignificance
Standard_IJG_50IJG (JPEG standard) Quality 50Default JPEG library. May indicate programmatic generation.
Standard_IJG_75IJG Quality 75Common default quality setting.
Standard_IJG_90IJG Quality 90High quality setting.
Photoshop_SaveWeb_HQAdobe Photoshop "Save for Web"Indicates Photoshop processing.

Match Types

  • Exact - Table matches perfectly (very high confidence)
  • Approximate - 95%+ similarity (high confidence)
  • Similar - 85-95% similarity (medium confidence)

Why Standard Tables May Indicate AI

Real cameras typically use proprietary quantization tables. When an image uses the standard IJG tables, it often means:

  • The image was generated by software (not a camera)
  • It was re-encoded by a web service or API
  • The original metadata/tables were stripped

11. Glossary of Terms

AI Generation
Content created entirely by artificial intelligence from text prompts or other inputs.
B-frames
Bidirectional video frames that reference both past and future frames. AI-generated videos often lack these.
C2PA
Coalition for Content Provenance and Authenticity. An industry standard for content credentials.
CABAC
Context-Adaptive Binary Arithmetic Coding. Advanced compression often disabled in AI pipelines.
CFG Scale
Classifier-Free Guidance scale. A parameter controlling how closely AI follows the prompt.
ComfyUI
A node-based interface for AI image generation using Stable Diffusion.
Deepfake
AI-manipulated video, typically involving face replacement or animation.
Diffusion Model
The AI architecture used by Stable Diffusion, DALL-E 3, Midjourney, and most modern image generators.
Entropy
A measure of information density. High entropy means more random/complex data.
FFmpeg
Open-source video processing library. Often used in AI pipelines.
ftyp
The "file type" box at the start of MP4 files identifying the format version.
ICC Profile
Color management data embedded in images. Can contain identifying information.
JUMBF
JPEG Universal Metadata Box Format. The container standard for C2PA data.
LoRA
Low-Rank Adaptation. A technique for fine-tuning AI models.
Metadata
Information about the file embedded within it (creator, date, camera model, etc.).
MP4 Box
A structural unit in MP4 files. Different boxes contain different data types.
moov
The MP4 box containing movie metadata and track information.
Quantization Table
The mathematical table used during JPEG compression. Acts as a fingerprint.
Scene Cut
Video encoding feature that detects scene changes. Often disabled in AI video.
Seed
The random number used to initialize AI generation. Same seed = same output.
Signature
A text string or pattern that identifies a specific tool or process.
Stable Diffusion
Open-source AI image generation model. Basis for many AI art tools.
stts
MP4 box containing time-to-sample information (frame timing).
Text-to-Image (T2I)
AI that generates images from text descriptions.
Text-to-Video (T2V)
AI that generates videos from text descriptions.
WebCodecs
Browser API for low-level video decoding. Used for frame analysis.
x264/x265
Video encoding libraries for H.264 and H.265 formats.