Cross-referencing of information
about Mycoplasma pneumoniae and genitalium genes
There are many different sources of information
concerning the genes of M. pneumoniae and M. genitalium. These
sometimes involve different naming systems for the genes and different sources
of functional annotation that do not always agree. We are attempting to collect
much of this information in a spreadsheet. The actual entries for each column
(A - AB) are given here for one gene as an example:
- A) MPN001 --The name of the
M. pneumoniae gene according to the new naming system used by Bork and
Genbank
- B) 153 -- The sequential number
of the M. pneumoniae gene in the coordinate of the original published
sequence (called the ZMBH #)
- C) K05_orf380 -- The original
M. pneumoniae gene names from Hermann's lab based on the name
of the cosmid containing the gene and its size
- D) g1673814 -- The protein identifier
(PID) for the M. pneumoniae gene product
- E) MG001 -- The name of the homologous
M. genitalium gene, if any (in this case the orthologous gene
- but not always)
- F) MG001 -- The name of the orthologous
M. genitalium gene if there is one
- G) 735..1829 -- The coordinates
of the M. genitalium gene on the genome sequence
- H) + -- The orientation of the
gene on the M. genitalium genome
- I) 365 -- The length of the coding
region in codons
- J) 12044851 -- The protein identifier
(PID) of the M. genitalium ortholog
- K)
MPN001blast
-- Link to a blast search against the genes in the NCBI COG database
- L) dnaN -- The conventional gene
name where one exists
- M) dna metabolism -- The TIGR
role category assigned to the M. genitalium gene
- N) L -- The one letter functional
group assignment of the NCBI COG page (Eugene Koonin)
- O) L -- Membership
in the indicated functional category of the minimal gene set of Mushegian
and Koonin
- P) no entry
indicates none observed -- Disruptive
transposon insertions observed within the orthologous genes in M. pneumoniae
and M. genitalium by
Hutchison et al. (1999)
- Q) 1 -- Indicates
membership in a universally conserved gene family according to the NCBI COG
database
- R)
context
-- Link to NCBI maps of the region containing the gene for all COG members
- S)
COG0592
-- The COG (cluster of orthologous groups)
to which the gene belongs according to Koonin; Linked to a display of
all COG members from NCBI
- T) This column is hidden behind P and
contains the COG numbers but without the NCBI link
- U) DNA polymerase III beta subunit
-- The predicted function from the Koonin COG system
- V) DNA polymerase III, subunit beta
-- The common name of the the gene product from the TIGR CMR
- W) DNA polymerase III, subunit beta
(dnaN) -- The gene product name from the NCBI gene list for M. genitalium
(not always the same as the COG prediction)
- X) DNA polymerase III beta subunit
(dnaN); STAAU -- The original M. pneumoniae gene function assignments
from the Hermann lab
- Y) 12044851 1 -- Similarity to
a protein with a 3-D structure determined if one exists (from NCBI pages);
The PID for the M. genitalium protein followed by the number of
sequences in the PDB structural database with significant similarity
(in this case 1)
- Z) no entry indicates not listed
-- Indicates the genes included in a list of candidates by Grigoriev (an
entry of "c" would indicate a listed gene product)
- AA) no entry -- An entry here
would name a homolog gene on a list from Rosie Kim, selected for structure
determination (for example MJ1228 would refer to a homologous gene in
M. jannaschii)
- AB) no entry -- An entry of "ortholog"
indicates the homolog selected for structure determination is likely
to be an ortholog of the mycoplasma gene, e.g. it is in the same COG. An
entry of "homolog" indicates that the homolog selected for structure determination
is unlikely to be an ortholog of the mycoplasma gene.
- AC) no entry -- Status of the
structure determination for these candidates (sample entries: a) solved, b)
Xtal, no diff.)
- AD) no entry -- Biochemical function
deduced from solved structures (sample entry: methyl transferase)
- AE) no entry --
PDB identifier of structure solved by BSGC
Mycoplasma gene cross-reference
-- The information is presented as an Excel spreadsheet. Activating this
link will either display the spreadsheet or will allow you to save the file
so it can be displayed with Microsoft Excel (the exact behavior varies with
the browser used and the operating system of your computer). This spreadsheet
is under construction. Please use at your own risk, and report any errors,
problems, or suggested additions and improvements to
Clyde Hutchison (clyde@email.unc.edu)
.
updated 4/8/02