MMF allows the user to supply his/her own motifs as the input to the software. These external motifs can be of two different kinds considered at different stages of computation:
Motifs from the Jaspar database
JASPAR is a collection of transcription factor DNA-binding preferences, modeled as matrices. It consists of the following sub-databases:
More detailed information containing these subdatabases can be found at the Jaspar project website.
Reference: Sandelin A, Alkema W, Engstrom P, Wasserman WW, Lenhard B. JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D91-4.
Website: http://jaspar.genereg.net/
Acceptable formats of the motifs provided by user
| Format description | Examplary data describing motif |
Logo of the motif created
with the WebLogo tool |
|---|---|---|
| Position Frequency Matrix (PFM) |
0 5 0 1 0 2 0 4 5 1 0 0 1 4 1 0 6 0 0 0 |
|
| Matrix format used in Transfac database |
01 0 3 1 1 C 02 0 0 1 4 T 03 0 4 0 1 C 04 0 0 0 5 T 05 5 0 0 0 A 06 5 0 0 0 A |
|
| Matrix format from Transfac with header line and arbitrary data in first and last column | PO A C G T XXX1 1 3 4 1 XXX6 XXX2 0 0 1 8 XXX7 XXX3 1 7 0 1 XXX8 XXX4 3 0 0 6 XXX9 XXX5 9 0 0 0 XXX10 |
|
Format of the database provided by user
Format of the external database containing motifs has to be the same as the one used by file MATRIX_DATA.txt in Jaspar database. Each line must consist of four positions:<motif's identifier> <symbol> <position> <frequency>A good example can be found here.
Short example describing database containing two motifs:
MA0001 A 1 0.0000 MA0001 A 2 3.0000 MA0001 A 3 79.0000 MA0001 A 4 40.0000 MA0001 C 1 94.0000 MA0001 C 2 75.0000 MA0001 C 3 4.0000 MA0001 C 4 3.0000 MA0001 G 1 1.0000 MA0001 G 2 0.0000 MA0001 G 3 3.0000 MA0001 G 4 4.0000 MA0001 T 1 2.0000 MA0001 T 2 19.0000 MA0001 T 3 11.0000 MA0001 T 4 50.0000 MA0052 T 5 49.0000 MA0052 T 4 55.0000 MA0052 T 3 0.0000 MA0052 T 2 58.0000 MA0052 T 1 7.0000 MA0052 G 5 0.0000 MA0052 G 4 0.0000 MA0052 G 3 0.0000 MA0052 G 2 0.0000 MA0052 G 1 0.0000 MA0052 C 5 0.0000 MA0052 C 4 1.0000 MA0052 C 3 1.0000 MA0052 C 2 0.0000 MA0052 C 1 50.0000 MA0052 A 5 9.0000 MA0052 A 4 2.0000 MA0052 A 3 57.0000 MA0052 A 2 0.0000 MA0052 A 1 1.0000