Proteomics - National Center for Research Resources
Search NCRR

Software and Tools

Concatenated Text File Splitter

Our current solution to data archival includes concatenation of the tandem mass spectral data files produced during SEQUEST analysis (so-called "dta files" by virtue of their extension) into a single text file. This program splits these concatenated files into separate files that can then be searched via SEQUEST or other software that expect .Dta files as input. We also routinely concatenate SEQUEST analysis output files (so-called "out files") in a similar fashion. This program will split these archived results as well.

Please see the Command Line Application Help page for additional information on running this program at the Windows command prompt.

Download Software Tool Download Source Code  

Version v1.0.2946 Requirements Microsoft NET Framework 1.1
Date Updated January 25, 2008 File Size (Software Tool) 166 KB (ZIP)
Registration Required No File size (Source Code) 81 KB (ZIP)
Developers Matthew Monroe
Comments The program no longer requires any external DLLs. It also now allows the input file to be specified without using /I: at the command line
See also the complete Revision History


Example Concatenated Data File

The following is an excerpt from an example _dta.txt file, which was created from a series of .dta files. The Concatenated Text File Splitter program can parse this file, looking for the header lines, and create separate .dta files for each section in the concatenated file. Note that the "scan=5563 cs=2" portion of the parent ion info line is a PNNL-specific add-on to .Dta files (see Matrix Science's Data File Format page for additional information).

=================================== "QC_File_30Aug07_Owl_07-08-03.5563.5563.2.dta" ==================================
998.62505 2   scan=5563 cs=2
227.045 7973.5
303.314 1632.7
326.17 1130.8
386.744 2049
455.543 1544.5
467.954 1080.8
477.865 1176.2
480.34 997.3
481.377 1102.3
489.778 1502.3
544.336 1376.8
673.386 5770.5
772.459 45675.8
773.538 5369.5
852.318 2238.1

=================================== "QC_File_30Aug07_Owl_07-08-03.5564.5564.1.dta" ==================================
1057.584 1   scan=5564 cs=1
509.314 816.6
527.214 831.2
640.366 1643.2
741.457 758.5
759.412 8421.5
790.407 656.2
807.373 1387.5
808.428 1746.3
825.351 3142.8
872.401 829.8
903.447 1003.2
904.444 653.1
920.483 1054.7
921.478 3488.3
938.483 4280
1039.495 2698.9

=================================== "QC_File_30Aug07_Owl_07-08-03.5565.5565.2.dta" ==================================
.
.
.