Measuring Line Edge Roughness: Fluctuations in Uncertainty



Similar documents
Using the Normalized Image Log-Slope

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Confidence Intervals for One Standard Deviation Using Standard Deviation

Using the Normalized Image Log-Slope, part 3

Jitter Measurements in Serial Data Signals

Improving Chromatic Dispersion and PMD Measurement Accuracy

Three-dimensional vision using structured light applied to quality control in production line

Computing the Fractal Dimension of Stock Market Indices

The Calculation of G rms

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

EXPERIMENTAL ERROR AND DATA ANALYSIS

New 3-Dimensional AFM for CD Measurement and Sidewall Characterization

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

Basics of Statistical Machine Learning

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

4. Continuous Random Variables, the Pareto and Normal Distributions

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Making Accurate Voltage Noise and Current Noise Measurements on Operational Amplifiers Down to 0.1Hz

Identification of Demand through Statistical Distribution Modeling for Improved Demand Forecasting

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

ANALYZER BASICS WHAT IS AN FFT SPECTRUM ANALYZER? 2-1

Moving averages. Rob J Hyndman. November 8, 2009

Terms concerned with internal quality control procedures

Microeconomic Theory: Basic Math Concepts

Sheet Resistance = R (L/W) = R N L

Signal to Noise Instrumental Excel Assignment

Interferometric Measurement of Dispersion in Optical Components

Determination of g using a spring

Understanding Poles and Zeros

Time Series and Forecasting

Auto-Tuning Using Fourier Coefficients

Introduction to Fourier Transform Infrared Spectrometry

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

MODULATION TRANSFER FUNCTION MEASUREMENT METHOD AND RESULTS FOR THE ORBVIEW-3 HIGH RESOLUTION IMAGING SATELLITE

Characterizing Digital Cameras with the Photon Transfer Curve

Review of Fundamental Mathematics

Synthetic Sensing: Proximity / Distance Sensors

Confidence Intervals for Cp

Industry Environment and Concepts for Forecasting 1

The Viscosity of Fluids

Experiment 5. Lasers and laser mode structure

Descriptive Statistics

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1

Statistics, Probability and Noise

Chapter 4 One Dimensional Kinematics

Determination of source parameters from seismic spectra

Determining Measurement Uncertainty for Dimensional Measurements

AP Physics 1 and 2 Lab Investigations

On Correlating Performance Metrics

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Control Charts for Variables. Control Chart for X and R

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

An Introduction to. Metrics. used during. Software Development

THE SIX SIGMA BLACK BELT PRIMER

CALCULATIONS & STATISTICS

Investment Statistics: Definitions & Formulas

Physics Lab Report Guidelines

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

Introduction to Digital Audio

3D TOPOGRAPHY & IMAGE OVERLAY OF PRINTED CIRCUIT BOARD ASSEMBLY

Dimensionality Reduction: Principal Components Analysis

Determining the Acceleration Due to Gravity

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

STATISTICAL REASON FOR THE 1.5σ SHIFT Davis R. Bothe

Regression III: Advanced Methods

Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm

Big Ideas in Mathematics

Geostatistics Exploratory Analysis

2. Simple Linear Regression

Forecasting in supply chains

Total Hot Spot Management from Design Rule Definition to Silicon Fabrication

1. How different is the t distribution from the normal?

Dimensional analysis is a method for reducing the number and complexity of experimental variables that affect a given physical phenomena.

Determining Polar Axis Alignment Accuracy

Lecture - 4 Diode Rectifier Circuits

In order to describe motion you need to describe the following properties.

SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS. J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID

Standard Test Method for Classification of Film Systems for Industrial Radiography 1

The Fundamentals of MTF, Wiener Spectra, and DQE. Motivation

Figure 1. A typical Laboratory Thermometer graduated in C.

Confidence Intervals for Exponential Reliability

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

The Fourier Analysis Tool in Microsoft Excel

Fundamentals of modern UV-visible spectroscopy. Presentation Materials

10.7 Kinetic Molecular Theory Kinetic Molecular Theory. Kinetic Molecular Theory. Kinetic Molecular Theory. Kinetic Molecular Theory

CHAPTER 5 Round-off errors

The Method of Least Squares

Simple Regression Theory II 2010 Samuel L. Baker

Sample Size and Power in Clinical Trials

College on Medical Physics. Digital Imaging Science and Technology to Enhance Healthcare in the Developing Countries

Spectral Line II. G ij (t) are calibrated as in chapter 5. To calibrated B ij (ν), observe a bright source that is known to be spectrally flat

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

Radiography: 2D and 3D Metrics of Performance Towards Quality Index

VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS

Introduction to X-Ray Powder Diffraction Data Analysis

PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUMBER OF REFERENCE SYMBOLS

Least Squares Estimation

Utilization of AIMS Bossung plots to predict Qz height deviations from nominal

Transcription:

Tutor6.doc: Version 5/6/08 T h e L i t h o g r a p h y E x p e r t (August 008) Measuring Line Edge Roughness: Fluctuations in Uncertainty Line edge roughness () is the deviation of a feature edge (as viewed top-down) from a smooth, ideal shape that is, the edge deviations of a feature that occur on a dimensional scale smaller than the resolution limit of the imaging tool that was used to print the feature []. Line width roughness (LWR) is defined similarly. As one might expect, the same measurement tools and techniques used to measure the width of a feature are often used to measure its roughness, with the scanning electron microscope (SEM) being the most common tool. Unfortunately, doing a good job of measuring roughness is extremely difficult and fraught with pitfalls. Very different approaches are required for measuring the roughness of a feature compared to measuring the width of that feature. First, measuring the uncertainty in an edge is a much more noisy exercise than measuring edge itself. Assuming only (normally distributed) random errors in edge position measurement, for n measurements of an edge (or width) used to determine its roughness, the relative uncertainty in the measured roughness (with roughness expressed as the standard deviation of the edge position, σ ) can be estimated as σ σ n () To measure the to 0% precision, 50 measurements of edge positions must be made. While this may seem like a lot of measurements, there is any easy way to get many more than this. By taking an image of the rough feature (Figure ) and processing it off-line to determine edge position, it is very easy to extract hundreds of edge deviations from this one image and thus calculate σ with sufficient precision. But while the precision of an measurement can be easily quantified and controlled, how about the accuracy of the measurement? In particular, is such a measurement unbiased, meaning that as the precision error of the measurement goes to zero, does the measured edge deviation (σ measured ) approached the actual edge standard deviation (σ )? In general the answer is no. There are two main sources of bias (also called systematic error) commonly encountered in measurement: SEM noise, and the measurement s finite frequency content. The difficulty in measuring uncertainty (such as the uncertainty in a feature edge) is that any uncertainty in the measurement itself (such as noise in the SEM signal) adds in quadrature to the thing being measured. Unlike measuring the mean value (such as measuring the feature width), errors don t cancel on average when making many repeated measurements. As a result, the measured roughness is biased it appears greater than the actual roughness []:

measured SEM noise σ = σ + σ () SEM noise comes from the limited number of electrons detected and shows up as the noisy snow that effectively defines the look and feel of an SEM image. Low electron dose, defocus of the beam spot, and stigmatic error all contribute to increased SEM noise. Optimal SEM operation results in σ SEM noise not much lower than nm [3], which is on the order of the roughness of many of the features being measured. When measuring feature critical dimension (CD), SEM noise is often dealt with by pixel averaging (binning). Such image smoothing techniques, however, smooth out the true roughness in the image just as much as the SEM noise. The best way to correct for measurement noise bias is to turn off all pixel smoothing algorithms and instead carefully measure the SEM noise directly (preferably on the exact sample of interest) and subtract it from the measured roughness value using equation () []. Although it may involve taking multiple images of the same sample (thus making measurements slow), the results can be made as accurate as patience allows. Unfortunately, since 93 nm resists react physically to electron exposure producing resist shrinkage, they have strict limits on the total electron dose that can be tolerated. The second source of bias in measurements tends to always make the measured less than the actual. The fundamental roughness of a feature edge has not only a magnitude but a frequency content as well. For a measured edge deviation (y), where y represents the different points along the line where the edge is measured, its spatial frequency behavior is best captured by examining its Fourier transform. A very convenient way of viewing this frequency content is through the Power Spectral Density (PSD), defined as the square of the magnitude of the edge deviation Fourier transform. iπ f y PSD( f ) = ( y) e dy () (Similarly, the LWR PSD can be defined in the same way.) The PSD shows the square of the roughness (that is, the edge variance) per unit spatial frequency. For a type of roughness called white noise, the PSD is a constant for all spatial frequencies. But resist roughness is not white noise, and the behavior of the PSD as a function of spatial frequency can be very revealing. The interesting behavior comes from the fact that a given point on the edge is correlated with nearby points, but not with points far away. The correlation of different edge points is described by their correlation function R(τ): R( τ ) = lim ( y) ( y + τ ) dy (3) L L L / L / where τ is the distance between two points along the line. The PSD is the Fourier transform of the correlation function (under some constraints that are thought to apply here).

A very simple and common model for roughness correlation is to assume that very close points are perfectly correlated, and then the degree of correlation falls off exponentially with distance. e ( τ / ) α L c R ( τ ) = σ (4) where σ is the magnitude of the, L c is called the correlation length, and α is called the roughness exponent (also called the Hurst exponent). Correlation lengths for 93 nm resists have been reported to be in the range of 0 00 nm. For the special case of α = 0.5, the PSD can be calculated analytically [4]: σ L PSD( f ) = (5) + (π f L A plot of equation (5) is shown in Figure. At frequencies below /πl c (length scales larger than the correlation length), the PSD becomes flat (white, uncorrelated noise). Above this frequency, the PSD falls off as /f (fractal behavior). By Parseval s theorem, the area under the PSD curve must equal variance ( σ ), and integrating equation (5) over all frequencies does give this result. But when measuring, only a certain range of frequencies can be covered. In particular, the minimum possible frequency sampled by the measurement is determined by the maximum distance between measurement points along the line (that is, by the size of the measurement box, L box ). L box c c ) π fmin = (6) Any low-frequency roughness at frequencies less than this minimum will not be seen by the measurement. As a result, the roughness measurement will be biased, producing a value less than the actual roughness. Using the model PSD of equation (5), the impact of missing these low frequency components can be calculated. L σ = = c measured PSD( f ) df σ tan (7) π f Lbox min If L box is chosen to be much greater than the correlation distance L c, the measured roughness approaches the actual value. Figure 3 shows a plot of the square root of equation (7), something often called a σ(l) curve [5]. The measured value of or LWR approaches the actual value as the length of the line being measured approaches infinity. But for finite (and thus real) measurement lengths, the measured roughness is always less than the actual roughness. According to this simple model, when the measurement box is ten times the correlation length, the measured roughness is about 3.% too low. If the measurement length is reduced to five

times the correlation length, the measured roughness will be systematically low by about 6.5%. It is important to note that the amount of bias encountered for a given L box depends on L c, and is thus resist and resist process dependent. Measuring has two common systematic errors. SEM noise biases the measurement upward, and can only be corrected by characterizing the noise and subtracting it out. Smoothing algorithms typically employed for improved edge detection during CD measurement should never be used when measuring LWR or. Second, the finite frequency range of measurements means that very low frequency roughness will not be captured, biasing the measurement downward. This bias can be reduced by using very long measurement lengths, or by characterizing the correlation length and correcting the measurements using equation (7). Unless both of these biases are controlled and corrected for, any given measurement may be higher or lower than the actual roughness, possibly by a large amount. And since the biases are dependent both on the SEM (tool and measurement process) and on the resist (material and resist process), comparing biased measurements is extremely problematic. Most of the literature provides no description of how is measured, and thus whether these biases have been accounted for. It s no wonder that the industry has been so slow in understanding trends and mechanisms. References. Chris A. Mack, Fundamental Principles of Optical Lithography: The Science of Microfabrication, John Wiley & Sons, (London: 007).. J. S. Villarrubia and B. D. Bunday, Unbiased Estimation of Linewidth Roughness, SPIE Vol. 575 (005) pp. 480-488. 3. B. D. Bunday, et al., Determination of Optimal Parameters for CD-SEM Measurement of Line Edge Roughness, SPIE Vol. 5375 (004) pp. 55-533. 4. Y. Ma, H. Levinson, and T. Wallow, Line Edge Roughness Impact on Critical Dimension Variation, SPIE Vol. 658 (007) pp. 6584-6584-. 5. L. H. A. Leunissen, W. G. Lawrence, and M. Ercken, Line edge roughness: experimental results related to a two-parameter model, Microelectronic Engineering, Vol. 73-74 (004) pp. 65-70.

Figure., as described by the three sigma deviation of an edge from a straight line, is measured with high precisions by capturing the image of a long segment of a line.

000 00 PSD (nm 3 ) 0 0. 0.0 0. 0 00 000 Spatial Frequency f (µm - ) Figure. An idealized PSD with σ =.5 nm and L c = 5 nm.

..0 σ meas /σ 0.8 0.6 0.4 0. 0.0 0 4 6 8 0 L box /L c Figure 3. A plot of the measured relative to the actual value as a function of the size of the measurement box used. The measured value approaches the actual value as the length of the line being measured approaches infinity.