The SYBYL Data Structure

Molecule Areas

The data structure in SYBYL is called the "molecule description". Molecule descriptions are are stored in each of SYBYL's "molecule areas" (M1, M2,...):

The number of molecule areas that can be used simultaneously depends on the amount of computer memory available.

Items in the molecule description

Special Considerations for Biopolymers

The molecule description input for biopolymers is handled somewhat differently than for other kinds of molecules. Biopolymers can be built from scratch using the "Biopolymer" option of SYBYL. The primary structure is specified from a list of monomer types, and the three-dimensional structure is automatically generated. The complete molecule description (monomer by monomer), in this case, is obtained from a "dictionary":

Monomer units are configured for use in "polymerization" building by virtue of the fact that their linkage atoms (e.g. N- and C-terminii of a peptide) have unfilled valence sites. The linkage atoms are identified in the various dictionaries.

The major drawback of the dictionary approach is that a limited number of pre-defined monomers is available. Note that the "BIGPRO" dictionary contains the majority of non-standard amino acids, as well as all blocking and end groups. You must generate molecule descriptions for missing monomers needed to build your biopolymer, and then enter that information in an appropriate dictionary.

Alternatively, biopolymer structures can be input from a PDB file. The dictionary is used, in this case, to assign everything but the atomic coordinates (which are taken from the PDB information). Some PDB files include a mixture of biopolymers and small molecules (e.g. a ligand). The small molecule description is automatically assigned by SYBYL, but often with poor accuracy (particularly atom type codes).

The Atom Expression

The atom expression contains a list of the atom id's of the atoms selected for graphical and computational operations. The atom expression can consist of a single atom or a collection of atoms that share a particular relationship (e.g. all in the same monomer). Atom expressions can be defined graphically or by command input. SYBYL provides a standard graphical interface for defining atom expressions:

The interface offers a variety of ways to define atom expressions based on atom, bond, or monomer selections. Atom selections used to define an atom expression can be input by clicking on the desired atoms in the corresponding molecule drawing or by choosing from any existing sets. The atom selection mode of the atom expression interface is shown below:

In bond selection mode, the atom expression contains one or more sets of bonded atoms. The bond selection mode of the atom expression interface is shown below:

In monomer selection mode, the atom expression interface expects input at the monomer level. Clicking on an atom, for example, selects the entire monomer containing that atom. The monomer selection mode of the atom expression interface is shown below:

Alternatively, monomers can be selected from the text box, by dragging the mouse over the desired monomer id in the sequence:

Atom expressions can also be directly input from the keyboard or an SPL script. The following special characters are used to compose atom expressions using written commands:

The use of the above characters in the construction of atom expressions is illustrated in the following examples. Note that in many cases, there are multiple equivalent ways to construct atom expressions.

Select all atoms in a molecule (note that the molecule area need only be specified if more than one area is occupied):

Select all atoms in a given monomer unit (substructure):

Select a particular atom by its "atom name" (note that the substructure containing the atom must be included in the atom expression because atom names can be identical in different substructures):

Select all atoms lying within a 2.5 Angstrom radius sphere centered about atom "CA" of "PHE1" (note that this involves the use of the "SPHERE" dynamic set rule):

Select all "C.AR" type atoms in substructure "PHE1":

Select the bond defined by the dummy atoms named "CA" and "CB" of substructure "GLU2":