This page describes some problems you may encounter when trying to import files with Phred qualities into Sequencher version 4. For general instructions on importing Phred qualities into Sequencher, click here. Sequencher is a leading tool for contig editing and assembly developed by Gene Codes Corporation.
A number of things can go wrong when trying to import qualities from a Phred-generated .fasta or .fasta.qual file. These include:
Some of the most common reasons for these problems are:
A detailed description of these problems is given below. For the impatient, here are some guidelines to follow to avoid the common problems:
Sequencher appears to have problems when trying to import sequences and trace files which have long file names.This is a confusing problem, because Sequencher often does not give any error messages - just some things are wrong after importing. These can include:
You can easily reproduce (or eliminate) this problem by renaming files before the Phred analysis.In our tests, the same files were imported without problems when the name (with extension) was 31 characters or less, but not if the names were 32 characters or longer. (This problem may be related to the maximum length of file names on the Classic MacOS).
Keep in mind that Windows may not show you the entire file name - the extension may be hidden, depending on the Explorer settings and the extension. To be safe, keep file names at 27 or fewer characters (to allow for the ".scf", ".ab1" etc. extensions) - shorter is safer.
This seems to be a bug in Sequencher, and we have not found a workaround other then renaming the files before analysis with Phred. Please contact GeneCodes with any comments about this.Unfortunately, it also seems to be a common problem, since the default settings on the ABI 3700 tend to produce very long file names.
Sequencher tries to automatically find the chromatogram files that belong to a sequence entry in the FASTA (or FASTA.QUAL) file. Apparently, Sequencher always uses the base calls in the chromatogram file, and the quality scores in the FASTA.QUAL file. Since Sequencher does this without any consistency checks, Sequencher may assign quality scores to the wrong bases! This can happen if:
Furthermore, it seems that Sequencher completely ignores the base calling information in the .phd files, which contain the current base calls.
One more problem can that can make it impossible to import Phred qualities: if the directory above the .FASTA and the .QUAL file contain the original ABI files, then Sequencher will (sometimes) give an error message:
File Error! (10000)
Some kind of error occured while attempting to read from or write to
a disk. Therefore your selected command failed.
If you delete, rename, or move the files in the parent directory, the import works fine.
The suggested directory structure and folder contents is as follows (assuming that your input files are in C:\MyProject\)):
Folder | Contents | Example |
---|---|---|
C:\MyProject\ | Your input files | MySequence1.ab1 MySequence2.ab1 |
C:\MyProject\phred_made\ | The root directory for all files created by Phred. | chromat_dir (a directory) edit_dir (another directory) phd_dir_dir (a third directory) |
C:\MyProject\phred_made\edit_dir | The fasta and fasta.qual files created by Phred or phd2fasta | MyProject.fasta MyProject.fasta.qual |
C:\MyProject\phred_made\chromat_dir | The SCF files create by Phred. The SCF files have exactly the same name as the input files. |
MySequence1.ab1 MySequence2.ab1 |
C:\MyProject\phred_made\phd_dir | The phd files create by Phred. |
MySequence1.ab1.phd.1 MySequence2.ab1.phd.1 |
Tip: Spaces in file an folder names often can cause problems. If possible, avoid all spaces and other "funny characters" in file and folder names (that also means you should not save files in your "My Documents" folder!).
You will get the directory structures above if you use the "Process: Consed format" or the "AutoPhred and Analysis" options in InterPhace, version 2.2.3 or newer (older InterPhace versions do not create the extra directory ("phred_made") in "AutoPhred", and can the results may not import well into Sequencher; but you can always move things around by hand, if you need to).
Sometimes, the qualities shown in Sequencher don't make sense. This is typically a result of problems resulting from long filenames or wrong directory structures, as discussed above.
One common mistake is to run Phred to only create FASTA and quality files, but not SCF (chromatogram) files. Sequencher will read the quality file, and then open the original (ABI etc.) trace files. But the base calls in the ABI files and those made by Phred will be different! Sequencher will not complain about this - it will simply assign quality values to the wrong bases without any warning. If you suspect this is the case, you can look at the .phd files, using a text editor like WordPad. A few lines down in the files, you will see the base calls made by Phred, and the associated qualities and positions in the trace. Typically, the first few bases are already different from those called by the ABI software and shown in Phred.
Note that problem is not limited to ABI, it can happen with other input files, too. You can always use InterPhace or TraceViewer to check if a trace file has Phred qualities by opening the file in TraceViewer or InterPhace: Phred-generated SCF files will show quality bars at the top, ABI files will not.
If you see this problem, simply re-process the files with Phred, making sure to create the correct directories, and then import the newly create files.
Some users have experience the following problem: after importing sequences from Phred-generated FASTA files, the file names of all sequences are wrong, and the sequences do have qualities (no colored backgrounds). The names of the imported sequences are often like "seq.fasta #1", "seq.fasta #2), and so on.
A related problem is that Sequencher does not import anything from Phred-generated quality (.qual) files.
To fix this problem, do the following:
CodonCode Home - CodonCode Support -More about Phred-Phrap: Phrap.com
© Copyright 2000-2003 CodonCode Corporation. All rights reserved.