Automated Image Quality Assessment (IQA) for SEM Metrology

Overview

A Python-based toolkit developed to ensure the stability and consistency of image quality from Scanning Electron Microscopes (SEM) used in high-volume semiconductor manufacturing. By automating the measurement of critical image degradation factors (focus, stigmation, noise), the project shifts quality control from subjective visual checks to objective, quantitative metrics. This directly supports accurate Critical Dimension (CD) and defect metrology, where image consistency is paramount.

Tech Stack: Python, NumPy, OpenCV, scikit-image, Matplotlib, SciPy

Environment: Jupyter Notebook / Colab for interactive analysis

Links & Artifacts

GitHub

Key Features & Technical Achievements

Automated Image Quality Assessment (IQA): Implemented a suite of both Full-Reference (e.g., Structural Similarity Index, SSIM; Peak Signal-to-Noise Ratio, PSNR) and No-Reference (e.g., Laplacian Variance, BRISQUE) metrics. This provides a comprehensive, objective score for image quality to quantify the tool health.
Domain-Specific Metric Implementation: Adapted the Normalized Variance Focus Score (Normvar), a key technique from academic literature, specifically for Gold-on-Carbon (GoC) reference images, aligning the tool with industry-standard qualification methods.
Tool-to-Tool Matching: Developed histogram comparison and Frequency-domain (FFT) analysis to statistically verify the consistency of operating parameters across different SEM machines, a critical requirement for high-reliability fabs.
Simulation of Degradations: Created functions to synthetically generate realistic SEM image flaws (Gaussian Blur for defocus, Anisotropic Blur for stigmation, and Poisson/Gaussian noise), providing a robust validation suite for the assessment algorithms.

Impact

Improved Manufacturing Yield: Reduced reliance on manual, subjective quality checks, leading to faster tool qualification and minimized downtime from unstable SEMs.
Enhanced Data Reliability: Provided quantitative proof of image quality, ensuring that all subsequent metrology and machine learning models are trained and run on reliable, consistent input data.
Stable image quality is table-stakes for reliable CD/overlay metrology. Focus drift, stigmation, detector noise, and operating-point changes silently degrade measurements. This project provides a repeatable, quantitative way to:
- Detect IQ drift early (before it hits SPC).
- Compare tool-to-tool outputs after maintenance or recipe transfers.
- Create explainable visuals (SSIM maps, FFT energy ratios, histogram overlays) for quick, defensible decisions.

Methods

Full-reference

SSIM compares luminance/contrast/structure between reference and test images. Good for localizing degradation (SSIM maps).
PSNR summarises signal fidelity (dB). Simple but useful for quick screening.

No-reference

Normalized variance: cheap focus/sharpness proxy.
FFT high-frequency ratio: proportion of energy above a radial cutoff; drops as images lose fine detail (defocus, stigmation).
CNR (ROI-based): ( = ) over stable ROIs; separates contrast shifts vs. noise growth.

Tool-to-tool matching

Histogram comparison (e.g., Bhattacharyya/Chi-square similarity depending on your implementation) across matched ROIs or full frames. Pairs nicely with CNR to disambiguate contrast vs. noise changes.

Featured IQA results:

Gold on Carbon(GoC) image focus assesment

GOC image under inspection — GoC SEM reference image evaluated under controlled blur and rotation. The Normalized Variance (NV) focus score remains largely stable under geometric transformations, while SSIM and PSNR fluctuate strongly—highlighting NV’s robustness as a focus metric for SEM imaging..

Metric Comparison Across Image Variations

Image Variation	NV Ratio	PSNR	SSIM
Original	1.000	∞	1.000
Blurred (σ = 1)	0.844	27.361	0.762
Blurred (σ = 2)	0.690	24.378	0.620
Brightness + 20%	1.205	23.288	0.970
Contrast + 30%	0.493	25.854	0.922
Rotated 90°	0.940	12.526	0.068
Rotated 180°	1.000	12.361	0.061
Rotated 270°	0.940	12.525	0.068

Findings

Normalized Variance (NV) ratio remains relatively stable across brightness changes but decreases consistently with blur.
PSNR and SSIM are highly sensitive to brightness/contrast changes, which are common in SEM imaging but don’t necessarily indicate focus quality.
NV is rotation-invariant, providing consistent scores for rotated images.
For Gold-on-Carbon SEM images, NV provides a more consistent focus measure that aligns with human perception of sharpness.

Stigmation

IMAGE QUALITY ASSESSMENT REPORT

Distortion Type: Stigmation

1. Main Metrics Comparison

Metric	Current Value	Ideal Value	Interpretation
SSIM	0.772	1.0000	Higher is better
PSNR	23.82 dB	∞ dB	Higher is better
NormVar Focus Ratio	0.938	1.000	Closer to 1 is better
Laplacian Sharpness Ratio	0.244	1.000	Closer to 1 is better

2. Detailed Scores Breakdown

Metric	Original	Stigmation
Normalized Variance Focus	75.69	71.02
Laplacian Sharpness	8174.52	1991.11
BRISQUE Score	44.38	14.576

3. Key Findings

BRISQUE is not a suitable metric.
Norm Var : Focus quality has degraded significantly.
Laplacian Sharpness: Image sharpness has been reduced.

FFT Sharpness

Comparing sharpness using FFT( Fast Fourier Transform)
Using simulated SEM like stigmated image for FFT sharness

fft_bases dharpness — FFT-based sharpness evaluation of SEM images. The power spectrum of the sharp reference image contains prominent high-frequency energy distributed uniformly across orientations, whereas the blurred or stigmated image displays a contracted spectrum with diminished high-frequency magnitude—quantifying sharpness loss in the frequency domain.

FFT Spectrum Analysis Report

Parameter	Value
Reference Sharpness	21154.38
Test Sharpness	15844.23
Sharpness Ratio	0.749
Sharpness Difference (Test − Ref)	-5310.15
Assessment	~ Moderate focus loss

Image Histogram

Histogram compare for tool-to-tool matching

  ========================================
  HISTOGRAM REPORT
  ========================================
  correlation_score : 0.8544051386777778
  chi_square : 13.265943275829432
  hellinger_distance : 0.2348046244606338
  intersection : 10.474023181097436
  ref_brightness : 59.33144887288411
  ref_contrast : 67.01170715490987
  dis_brightness : 59.025578816731766
  dis_contrast : 64.74474942295193
  brightness_difference : 0.30587005615234375
  contrast_ratio : 0.9661707210843404
  dynamic_range_ref : 255
  dynamic_range_dis : 255
  match_quality : Good
  ========================================

Conclusion

Effective for common failure modes.
- Focus/blur & stigmation: Laplacian and FFT-ratio trend down as blur increases; SSIM agrees and localizes loss on edges.
- Noise (shot/read): SSIM/PSNR decline with higher noise; FFT-ratio can rise (noise injects high-frequency energy) while Laplacian stays relatively stable, helping distinguish noisy-sharp from clean-blurred.
- Operating-point drift: Histogram means/widths and CNR reveal gain/offset and contrast changes that don’t always show up in sharpness scores.
Actionable in production. The metrics are quick to compute, easy to threshold, and simple to visualize on dashboards. They support spot checks, lot start/end health checks, and tool matching after maintenance or recipe edits.
Auditable and transferable. All steps are documented, with deterministic preprocessing (grayscale normalization, fixed ROIs, seeds). Because the methods are standard and lightweight, they are easy to adopt across teams and tools.

Practical takeaways

Use SSIM + Laplacian + FFT-ratio together: the trio separates blur from noise and flags structural loss where it matters (edges and features).
Track histogram stats for day-to-day tool consistency and after any detector/beam parameter changes.
When a flag triggers, inspect SSIM maps and FFT views to locate the problem (focus/stigmation/operating point).