1. What does TheorChromo do?

TheorChromo is an online tool for peptide and protein retention time prediction in liquid chromatography. It implements the BioLCCC model for the most popular case of linear gradient chromatography.

2. How should I use it?

Just follow the instructions on the main page. Firstly, enter the parameters of your LC system in the corresponding lines.
WARNING: All parameters are saved between sessions.

Column length, mm
The working length of your column, i.e. the length of the column area filled with an adsorbent. Usually this number is supplied by the manufacturer.
Column internal diameter, mm
The diameter of the internal volume of the column filled with an adsorbent. This number is also supplied by the manufacturer.
Packing material pore size, A
The average linear size of a pore within a particle of a packing material. Usually it is supplied by the manufacturer of the column or the packing material (if you use in-house packed columns).
Initial concentration of component B, %
The concentration of component B in the beginning of the linear gradient.
Final concentration of component B, %
The concentration of component B at the end of the linear gradient.
Gradient time, min
The duration of the linear part of the gradient.
Delay time, min
The duration of the flushing step preceding the linear part of the gradient. This number is simply added to the calculated retention time of a peptide.
Flow rate, ml/min
The average flow rate at the linear part of the gradient.
ACN concentration in component A, %
The concentration of ACN in the component with a lesser eluting ability.
ACN concentration in component B, %
The concentration of ACN in the component with the greater eluting ability.
Solid/mobile phase combination
The type of retention chemistry being used in the experiment. Currently, the two types are supplied:
RP/ACN+FA - Magic C18 AQ reversed solid phase, ACN as an eluent and 0.1% formic acid in both solvents;
RP/ACN+TFA - SynChropak RP-P C18 column, ACN as an eluent and 0.1% trifluoroacetic acid in both solvents.

Column geometry

Linear gradient

On the next step you need to enter peptide sequences. The sequences should be written using the standard one-letter codes for the twenty proteinogenic L-amino acids and the extended notation for modified amino acids.

WARNING: Only capital letters are allowed for the standard amino acids.
WARNING: Cysteine is assumed to be pure! If the peptides were treated with iodoacetamide, check the box "Cysteines are carboxyamidomethylated" or use the corresponding notation.

A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W
standard proteinogenic amino acids
Ac-
N-Terminal acetylation
-NH2
C-Terminal amidation
camC
carboxyamidomethylated cysteine
pS
phosphorylated serine
pT
phosphorylated threonine
pY
phosphorylated tyrosine
3. What is the scientific background behind TheorChromo?

TheorChromo is based on the BioLCCC (Liquid Chromatography of Biomacromolecules at Critical Conditions) model. Contrary to the other models of peptide chromatography, BioLCCC uses only very basic assumptions regarding flexibility of a polypeptide chain, the shape of a pore, type of interactions neglected, etc. Given this assumptions, the coefficient of distribution (Kd) of a peptide between solid phase and mobile phase of a given composition is derived using the methods of statistical physics of macromolecules. Finally, the retention time of a peptide is derived from Kd using the basis equation of gradient chromatography. TheorChromo is based on the BioLCCC (Liquid Chromatography of Biomolecules at Critical Conditions) model which describes the adsorption of protein molecules on porous media. Contrary to the other models of peptide/protein chromatography, BioLCCC starts from very basic assumptions regarding flexibility of a polypeptide chain, the shape of a pore, type of interactions neglected, etc. Given this assumptions, the coefficient of distribution (Kd) of a peptide between solid phase and mobile phase can be derived using the methods of statistical physics of macromolecules. Finally, the retention time of a peptide is calculated from Kd using the basis equation of gradient chromatography.

Owing to the physical basis of the BioLCCC model, it contains very few free parameters. The retention properties of an amino acid are characterized by a single number, which is essentially the energy of interaction between the amino acid and the surface of solid phase in pure water+ion paring agent. Given this small number of phenomenological parameters, the BioLCCC model can easily be adapted for an arbitrary type of chromatography not limited by phase or solvent types. Moreover, its extension to peptides with post-translational modifications is straightforward as it was shown for the phosphorylated amino acids.

Further reading:
1. Liquid Chromatography at Critical Conditions:  Comprehensive Approach to Sequence-Dependent Retention Time Prediction, Alexander V. Gorshkov, Irina A. Tarasova, Victor V. Evreinov, Mikhail M. Savitski, Michael L. Nielsen, Roman A. Zubarev, and Mikhail V. Gorshkov, Analytical Chemistry, 2006, 78 (22), 7770-7777.

2. Applicability of the critical chromatography concept to proteomics problems: Dependence of retention time on the sequence of amino acids, Alexander V. Gorshkov A., Victor V. Evreinov V., Irina A. Tarasova, Mikhail V. Gorshkov, Polymer Science B, 2007, 49 (3-4), 93-107.

3. Applicability of the critical chromatography concept to proteomics problems: Experimental study of the dependence of peptide retention time on the sequence of amino acids in the chain, Irina A. Tarasova, Alexander V. Gorshkov, Victor V. Evreinov, Chris Adams, Roman A. Zubarev, and Mikhail V. Gorshkov, Polymer Science A, 2008, 50 (3), 309.

4. Can I use BioLCCC on my own computer?

Yes, the source code of BioLCCC is open and can be used in any free project. Please, check libBioLCCC and, if you use Python in your work, pyteomics.biolccc projects.

5. I have another question regarding BioLCCC, where can I ask it?

You are welcome to the BioLCCC discussion group: http://groups.google.com/group/biolccc