S1_Batch_Preprocessing

This page covers the necessary set up and details for each step of the batch processing workflow for Sentinel-1 data:

Directory Setup

Organizing Data & Region Coverage Clipping and Stitching

Linking Data

Preprocessing Mode I: Choosing a Reference Image

Preprocessing Mode II: Aligning to Reference Image

Directory Setup

The first thing to do to prepare for batch processing is to organize your directory structure into the following path tree, for example, a descending track:

                        [des]
                          |
  ________________________|________________________________
  |       |        |        |     |    |    |      |      |
[data] [orbit] [reframed] [topo] [F1] [F2] [F3] [SBAS] [merge]
                                  |    |    |
                          ________|_________________________
                          |     |       |       |          |
                        [raw] [SLC] [intf_in] [topo] batch_tops.config

The directories are:

[data] is where you store your .SAFE files downloaded from ASF or Copernicus

[orbit] is where you store your orbit (.EOF) files

[reframed] is where you store your coverage area pins if you want to clip or stitch frames together

[topo] is where you store your downloaded DEM file

[F1], [F2], and [F3] are your subswath-specific processing directories (where you will preprocess your data and calculate your interferograms). Each should share the same subdirectory structure.

[SBAS] is where you will eventually process your time series (at a later point)

[merge] is where you will merge and store you merged interferograms (unless you choose to process only one subswath of Sentinel-1 data over you area)

The structure of the [F1], [F2], and [F3] subdirectories should be the same in each:

[raw] is where you store your prepared raw data and orbit files, and where you will process your raw data into coregistered SLC files

[SLC] is where you can store final SLC files

[intf_in] is where interferograms will be calculated and stored

[topo] is another directory where you will store a linked copy of your DEM file

batch_tops.config: A copy needs to be stored in each F1/2/3 directory you want to process data for. This will be your main config file for interferogram processing. You can download one from the GMTSAR github if you do not have one already.

Once your DEM file is stored in des/topo/ (check out Generating_DEM to learn more), you can link it to your F1 directory:

cd F1/topo
ln -s ../../topo/dem.grd .

And do that for each F2 and F3 directory you plan to process.

Organizing Data & Coverage Clipping & Stitching

This is an optional step for anyone who wants to cover a larger area than one single Sentinel-1 frame. Skip this step if you only want to process one frame.

Choosing Stitch Points

You can choose to stitch multiple frames from the same track together to make a larger coverage area. This only works for frames within the same track (and only the same ascending OR descending track, they cannot be mixed).

First you must define your region of interest covered by the frames you want to stitch together. The most straightforward way to do this is to plot the .SAFE file preview KML files in Google Earth (Check inside your .SAFE file under preview/map_overlay.kml).

One you have those loaded, pick your stitch point (pin) locations of where you want your coverage to start and stop.

CRITICAL When you pick your pin locations, they must be in the order of the satellite data collection. For instance, for a descending track set of frames, the first lon/lat point must be the point that is closest to the north, or top of the image, while the second point must be further south of the first pin point. For ascending data, the first point must be a point closest to the south of the image.

When you pick your points, create a file called “pins.ll” and place your lon/lat locations in the correct ascending or descending order. Then place this pins.ll file into the reframed/ directory.

TIP: If you are trying to cover a large area with many frames, you may need multiple stitch points (a rule of thumb is make sure to have one approximately every ~150 km).

TIP: Pay attention to the projected bounds of your coverage. To pick a working pin location, it must be in a position that has data across the latitude along the entire data file (i.e., if you pick a pin location on a corner of the image it may not be successful unless the corner lines up with available data)

Running organize_files_tops

The tool that does the actual stitching of frames together is called organize_files_tops.csh. Note that there is both a unix and a linux version (organize_files_tops_linux.csh)

This script will download the precise orbit files for you if you have not previously downloaded them.

Inside the data/ directory:

# make a list of SAFE files with the absolute path
ls -d $PWD/*.SAFE > SAFE_filelist

organize_files_tops.csh SAFE_filelist ../reframed/pins.ll 1 >& oft_mode1.log &

# or, if you are on a linux machine:
organize_files_tops_linux.csh SAFE_filelist ../reframed/pins.ll 1 >& oft_mode1.log &

# optional: to watch the screen output, tail the log file
tail -f oft_mode1.log

# when finished, check the log file for any errors

Mode 1 of organize_files_tops.csh will go through and make sure you have orbit files and will tell you which dates of SAFE files it will be able to re-organize and stitch together for you. Mode 2 is the part that actually completes the stitching.

To run mode 2:

organize_files_tops.csh SAFE_filelist ../reframed/pins.ll 2 >& oft_mode2.log &

# or, if you are on a linux machine:
organize_files_tops_linux.csh SAFE_filelist ../reframed/pins.ll 2 >& oft_mode2.log &

Once this is done running, there should be a new directory starting with F????_F???? (there may be multiple new directories). Inside these new directories are your stitched .SAFE files. You can move all files to a single F????_F???? directory if desired.

Common Errors/Mishaps:

Not using absolute path in the list of SAFE file names – you need absolute path, not relative path

Stitch points/pins are backwards or not placed correctly (this may manifest as the log file telling you it cannot stitch the frames together)

Precise orbits may not be available yet. If you want to process with restituted orbits instead, download them manually, place them in the reframed directory and then run this command for each SAFE file you have that requires a restituted orbit:
create_frame_tops.csh "SAFE_file_absPath" "matching restituted orbit name"

Then you can move these final files to the same F????_F???? directory as your other .SAFE files.

Linking Data

The next step in preprocessing is to link your data to the raw directory for processing.

For those interested in processing all three subswaths (F1,F2,F3), complete this step in each directory. For those interested in just processing one or two subswaths, just complete this in the desired subswaths directories.

# For users who stitched frames together:
cd F1/raw
ln -s ../../data/F*/*.SAFE/*/*iw1*vv*xml .
ln -s ../../data/F*/*.SAFE/*/*iw1*vv*tiff .

# For user who did not do any stitching, omit the stitched directory like so:
ln -s ../../data/*.SAFE/*/*iw1*vv*xml .
ln -s ../../data/*.SAFE/*/*iw1*vv*tiff .

# Then all users should link orbit files and their DEM file:
ln -s ../../data/*.EOF .
ln -s ../topo/dem.grd .

NOTE: “iw1” files correspond to the F1 subswath. For F2, change the above to “iw2” and for F3, “iw3”.

Preprocessing Mode I

The first mode of preprocessing your Sentinel-1 data is to calculate and plot a baseline diagram showing the placement of all your chosen input images in your batch. To accomplish this, you first need to create a data.in file, which is the input file for preproc_batch_tops.csh . All preprocessing occurs in the specific subswath directories you plan to process (e.g., if you are processing all three subswaths, you need to preprocess each subswaths’s raw data separately).

As an example, start in the F1 directory and go to the raw/ directory where you just linked all your data.

cd F1/raw

Creating data.in

The data.in file is a list of SAR tiff image namestems, listed with their corresponding orbit files and separated by a colon. If you have done any stitching of multiple frames together, each frame that has been stitched needs to be listed. In addition, you make a separate data.in file for each subswath you are processing.

For example, for one frame of data, the first few lines of data.in file will look like:

s1a-iw1-slc-vv-20180207t043037-20180207t043048-020496-0230e3-005:S1A_OPER_AUX_POEORB_OPOD_20180227T120553_V20180206T225942_20180208T005942.EOF
s1a-iw1-slc-vv-20180219t043037-20180219t043048-020671-023679-005:S1A_OPER_AUX_POEORB_OPOD_20180311T120551_V20180218T225942_20180220T005942.EOF
s1a-iw1-slc-vv-20180303t043037-20180303t043048-020846-023c05-005:S1A_OPER_AUX_POEORB_OPOD_20180323T120847_V20180302T225942_20180304T005942.EOF

while a data.in file with data that has been stitched together will look like:

s1a-iw1-slc-vv-20150911t014915-20150911t014946-007661-00aa13-004:s1a-iw1-slc-vv-20150911t014943-20150911t015013-007661-00aa13-004:s1a-iw1-slc-vv-20150911t015011-20150911t015024-007661-00aa13-004:S1A_OPER_AUX_POEORB_OPOD_20151001T122318_V20150910T225943_20150912T005943.EOF
s1b-iw1-slc-vv-20181230t014851-20181230t014922-014265-01a882-004:s1b-iw1-slc-vv-20181230t014919-20181230t014949-014265-01a882-004:s1b-iw1-slc-vv-20181230t014947-20181230t015000-014265-01a882-004:S1B_OPER_AUX_POEORB_OPOD_20190119T110617_V20181229T225942_20181231T005942.EOF
s1b-iw1-slc-vv-20190111t014851-20190111t014921-014440-01ae28-004:s1b-iw1-slc-vv-20190111t014919-20190111t014949-014440-01ae28-004:s1b-iw1-slc-vv-20190111t014946-20190111t014960-014440-01ae28-004:S1B_OPER_AUX_POEORB_OPOD_20190131T110643_V20190110T225942_20190112T005942.EOF

You can either make your own data.in file, or you can use the tool prep_data.csh or prep_data_linux.csh which will only work for precise orbits (resituted orbits, if used, will need to be added manually).

For example:

# inside the F1/raw directory, with all data, orbits, and DEM files linked properly
prep_data.csh

Next, use this data.in file to preprocess in mode 1.

Running preproc_batch_tops.csh

From the raw directory, we now want to calculate the temporal and spatial baselines of the images so that we can choose a reference image for our batch processing/time series processing. Mode 1 of preproc_batch_tops.csh does this calculation and produces a baseline_table.dat and a baseline.ps file as a preparation step.

To run, make sure you are still in F1/raw (or whichever subswath you are processing):

# This runs in the background and saves screen output to a log file
# use "tail -f pbt_mode1.log" to watch the screen out put as it prints

preproc_batch_tops.csh data.in dem.grd 1 >& pbt_mode1.log &

Once Mode 1 has finished, you can open the baseline.ps file to see the distribution of baselines of your images. To choose a good reference image for your entire batch of images, pick one of the dates in the middle of the distribution of image dates.

To set your chosen date as your reference image, you then need to manually open the data.in file and move the line that contains that image date to the top of the data.in file list. This way, the mode 2 program will know to align all of your images to that chosen reference image.

Preprocessing Mode II

Once you have chosen a reference image, and have moved that image line in data.in to the top of the file you can now run preproc_batch_tops.csh with mode 2, which will actually do all the alignment and image co-registration.

# This runs in the background and saves screen output to a log file
# use "tail -f pbt_mode2.log" to watch the screen out put as it prints
preproc_batch_tops.csh data.in dem.grd 2 >& pbt_mode2.log &

NOTE Preprocessing with Mode 2 will take much longer than mode 1. For ~30 images it may take roughly 1-3 hours per F? subswath, depending on the processing power of your machine.

IF YOU ARE PROCESSING MULTIPLE SUBSWATHS Remember to go back and complete the preproc_batch_tops.csh process (mode 1 and mode 2) in each subswath raw/ directory. You will need to select the same reference image date for each subswath (and it needs to be promoted to the first line in the specific subswath’s data.in file). Don’t be tempted to use the same data.in file for each subswath, they are not interchangeable.

Re-creating baseline_table.dat

If you find you need to re-create your baseline_table.dat file, you can use the get_baseline_table.csh.

Once you are done with preprocessing, you can start calculating your interferograms! Check out S1_Batch_Interferograms