File Format Guide

Files are available in a variety of formats. Some data collections are available in more than one format. The GLCF aims to provide spatial data in standard formats, though legacy and special collections may retain others. This guide is provided to explain how users can identify and utilize these formats. Anyone with questions may contact the GLCF directly. Note that some file formats require the presence of multiple files. For example, one file will contain the actual image, another, often called the 'header' will contain details about the location and size of the image, and another, often called the 'metadata', may contain more detailed explanation of the image origins, processing, size and location. Some formats will contain everything internally, but even they may have exceptions. Be careful to grab all files needed to operate a single image.

Translate the file name

File names are standardized within collections whenever possible. Where imagery or products are generated externally, the GLCF has attempted to retain the naming standard of origin. Internally generated products are provided a standard name appropriate to the collection. Most file name reference the file's format. Please refer to the GLCF's Naming Conventions Format Guide to locate the file format reference contained within names.

Identify the file format types used at GLCF

Primary file formats

Format Translation Extensiont Comment
GeoTIFF Geostationary Earth Orbit Tagged Image File Format .tif a TIFF with location
BSQ Band SeQuential .bsq binary
HDF Hierarchical Data Format .L1G NASA
Fast fast .fst Landsat
Arc Grid Arc/Info Interchange .e00 ESRI
Mr.Sid Mr.Sid .sid proprietary
Shapefile Shapefile .shp vector standard

Secondary files

Extension Description
.dbf dBase database file, accompanies shapefiles; by ESRI
.gz compressed using Gzip, by GNU Zip
.jpg Joint Photographic Experts Group image file (non-spatial)
.png Portable Network Graphics image file (non-spatial)
.prj text file, with projection details
.shx spatial index file, accompanies shapefiles; by ESRI
.sbx spatial index file, accompanies shapefiles; by ESRI
.sbn spatial index file, accompanies shapefiles; by ESRI
.sid Mr.Sid compressed image file, an option with Landsat mosaics
.tfw world file, sometimes accompanies GeoTIFF

Metadata files

Extension Collection(s)
.met Landsat GeoCover, Landsat GeoCover Mosaics, GLS 1990- Landsat TM, GLS 2000- Landsat ETM+
.hdr Landsat , IKONOS
.cof Landsat
.ddr Landsat MSS
.hd Landsat NLAPS
.H1 Landsat MSS
.HI Landsat; processing history
.ip3 Landsat
HPN.fst Landsat Fast
HRF.fst Landsat Fast
HTM.fst Landsat Fast
HDF.L1G Landsat HDF
MTL.L1G Landsat BSQ and HDF
MTL.L1G Landsat BSQ and HDF
.wo Landsat NLAPS
.dif various, for GCMD
.hdf.met ASTER
.txt MODIS VCF, UMD Land Cover, Tree Cover
.metadata.txt IKONOS
.RPB QuickBird
.TIL QuickBird
.IMD QuickBird
.PVL OrbView
MTL.TIF GLS 2005- Landsat ETM+
MTL_L1T.TIF EO-1 ALI

Metadata files

GeoTIFF HDF BSQ Software Company
ArcExplorer ESRI
tutorial tutorial tutorial ArcGIS 9.x ESRI
tutorial tutorial tutorial ArcView GIS 3.x ESRI
BSQ ER Mapper Geospatial Imagery Solutions
tutorial tutorial tutorial Envi 4.x ITT Visual Information Solutions
tutorial tutorial tutorial IDRISI Andes Clark Labs
ImageJ US NIH RSB
tutorial tutorial tutorial Imagine 9.x Leica
MultiSpec Purdue Research
Foundation
MyWorld GIS GEODE, Northwestern
Photoshop Adobe

Details about the GeoTIFF format

GeoTIFF data is one of the most widely supported spatial data formats. GLCF uses this as a standard format, especially suited for imagery and other raster spatial files. GeoTIFF is a public domain format originated by Dr. Niles Ritter where basic TIFFs (Tag(ged) Image File Format) utilize geospatial tags imbedded within the TIFF file. Most remote sensing or GIS software will read these internal tags and automatically import a file's spatial coordinates, as well as any additional information (map projection, datum, byte order, etc.) which may be found in the tag. Software packages such as Photoshop or Microsoft Word which do not utilize spatial information will still be able to read in GeoTIFFs as normal TIFFs (in some cases support for TIFF 6.0 is required). In general the GeoTIFF format is also platform independent. Essentially, GeoTIFF is a TIFF file with location.
Imagery in this format use the industry-wide TIFF image specification from Adobe and include a special header for storing georeferencing information. Scenes in GeoTIFF format can easily be read into any image software package that recognizes TIFFs and those that can understand the GeoTIFF-specific header can also automatically georeference the image. More information about GeoTIFF can be found at the GeoTIFF site at remotesensing.org.

Details about the L1G/HDF/BSQ format

HDF is a format created by NASA for imagery and other raster spatial data. L1G is a level of processing for Landsat imagery that indicates the imagery is ready to use. L1G translates as "Level 1G", meaning the data has been processed to level 1 and is radiometrically and geometrically corrected. Since most Landsat 7 data files from the USGS include the "L1G" extension, the GLCF maintains this naming convention. Please note, however, that "L1G" indicates the processing level and not the actual file format.
There are several versions of HDF. At the GLCF, L1G data is often in the "Unwrapped HDF" format. The GLCF provides each band of an image as a separate file, so the bands of each scene can be considered "Unwrapped" files, versus one file containing all bands. A special file, ending with "_HDF.L1G" accompanies each Landsat scene available in this format. Some spatial software will read the _HDF.L1G file accompanying a scene and use the information therein to handle the image as a single file, even though the reality is that the files are each distinct. Software reading HDF format will allow users to load the scene as a whole, or bands as separate entities. It is essential that the user copy both the band specific file plus the _HDF.L1G file.
If spatial software will not read the _HDF.L1G file, the user may use the .L1G files as separate entities by treating them as BSQ files. This is the Band SeQuential binary format, which most spatial software can easily read. The requirement for this format, however, is that the user enter all the header information needed by the software to correctly display and locate the image. The software will prompt the user for details about the scene's corner coordinates, projection, datum, pixel size, number of rows and number of columns. To locate this information, open the metadata file appropriate to this scene (usually the "_MTL.L1G" for L1G files) in a text editor and copy the information.

Details about the BSQ format

This format stores imagery as a flat binary file with header information located in a separate metadata file. Although a wide variety of header files exist; software packages may or may not be able to read the header files to get the image dimensions or georeferencing information. If the software package cannot understand the header file, this information must be entered in manually.
BSQ, or "Band Sequential" format, is a popular standard in image formats. Each image band is written as one independent file and, as such, BSQ can be a preferable format insofar as one does not have to read past ancillary data in an image stack. As opposed to formats such as a multi-band GeoTIFF (where all the bands are in one file), BSQ data sets give the user the option of only downloading the relevant bands.

Details about the Fast format

The Fast Format is an example of the BSQ format with a header file, independent bands followed by "End of File" markers, followed, in turn, by an "End of Volume" marker.

Details about the Mr.Sid format

This format is proprietary to LizardTech, where imagery is compressed but easily displayed. Some spatial software does not support this image format, though a free viewer can be downloaded from LizardTech.


The Global Land Cover Facility
University of Maryland, Copyright 2006

Quick Links
Copyright © 1997-2016 University of Maryland. All rights reserved
University of Maryland National Aeronautics and Space Administration GOFC_GOLD