Case study input data

Dataset for 20 strains of E. coli with manually curated annotations available in the ColiScope database (Marie Touchon et al., PLoS Genetics, 2009), formatted as input for eCAMBer. Annotations from PATRIC and Coliscope are provided in the zip bundle.

Datasets from our previous work formatted as input for eCAMBer:

  • Input dataset for 2 strains of S. aureus.
  • Input dataset for 9 strains of M. tuberculosis.
  • Input dataset for 22 strains of S. aureus.
  • Input dataset for 41 strains of E. coli.

CAMBerVis

  • Two example datasets for M. tubercuolisis and S. aureus (with plasmids) are integrated with the software.
Details on the input datasets from our previous work
  • Details on the input data used in S. aureus and E. coli case studies can be found in the provided GenBank files.
  • Details on the input data for the M. tuberculosis case study:
    • H37Rv - genome sequence and annotation from NCBI database,
      (NCBI ID: NC_000962.2, last change: Jan 5, 2005, download: Jul 8, 2010)
    • H37Ra - genome sequence and annotation from NCBI database,
      (NCBI ID: NC_009525.1, last change: Mar 12, 2007, download: Jul 8, 2010)
    • F11 - genome sequence and annotation from NCBI database,
      (NCBI ID: NC_009565.1, last change: Jun 14, 2007, download: Jul 8, 2010)
    • KZN 4207(B) - genome sequence and annotation from the Broad Institute database,
      (download: Jul 8, 2010)
    • KZN 4207(T) - genome sequence and annotation from supplementary material of the publication
    • KZN 1435 - genome sequence and annotation from the Broad Institute database,
      (download: Jul 8, 2010)
    • KZN V2475 - genome sequence and annotation from supplementary material of the publication
    • KZN R506 - genome sequence and annotation from supplementary material of the publication
    • KZN 605 - genome sequence and annotation from the Broad Institute database,
      (download: Jul 8, 2010)