Jack in the box XD


SUBMITTED BY: cramer

DATE: Oct. 1, 2015, 11:42 a.m.

FORMAT: Text only

SIZE: 3.0 kB

HITS: 547

  1. All of the test files are actual genetic data (save the spell comparisons)
  2. taken from GenBank. Files fli*.txt come from bacterial flagellar protiens,
  3. ftsa.txt is a cell division protien. stx*.txt contain the gene for the
  4. toxin in the "Jack-in-the-box" E. Coli that caused illness and death in the
  5. Western US in 1993 after it contaminated a batch of hamburger meat. The
  6. genetic data comes from the E. Coli as well as the origin of the toxin,
  7. the Shagella bacteria. The ecoli*.txt files are the first set number of
  8. base pairs from two types of E. Coli: the labratory strain, E. Coli K12, and
  9. the "Jack-in-the-Box" E. Coli, E. Coli O157:H7.
  10. .
  11. List of Files
  12. -----------------------------------------------------------------------
  13. Name: fli8.txt
  14. Length: 10 and 8 bp
  15. Protein: FliC (flagellar Cap protein)
  16. Species: E. Coli K12 & Salmonella typhi
  17. Notes: This file should test for insertion of gaps along with mismatches
  18. Name: fli9.txt
  19. Length: 9 and 7 bp
  20. Protein: FliC (flagellar cap protein)
  21. Species: E. Coli K12 & Salmonella typhi
  22. Notes: This file test for consecutive gaps; very easy to recognize best
  23. match
  24. Name: fli10.txt
  25. Length: 10 and 10 bp
  26. Protein: FliC (flagellar cap protein)
  27. Species: E. Coli K12 & Salmonella typhi
  28. Notes: This file tests for no gap only mismatches; easy to recognize best
  29. match
  30. Name: stx19.txt
  31. Length: 19 and 19 bp
  32. Protein: StxA and StxB (Shiga Toxin Subunits A&B)
  33. Species: Shigella Dysenteriae & E.Coli 0157:H7 (Jack-in-the-Box E. Coli)
  34. Notes: This file test for gaps on each side
  35. Name: stx26.txt
  36. Length: 21 and 26 bp
  37. Protein: StxA and StxB (Shiga Toxin Subunits A&B)
  38. Species: Shigella Dysenteriae & E.Coli 0157:H7
  39. Notes: Just a longer string that has gaps, consecutive gaps, and mismatches
  40. Name: stx27.txt
  41. Length: 21 and 27 bp
  42. Protein: StxA and StxB (Shiga Toxin Subunits A&B)
  43. Species: Shigella Dysenteriae & E.Coli 0157:H7
  44. Notes: Longer string w/ consecutive gaps
  45. Name: gene57.txt
  46. Length: 57 and 56 bp
  47. Protein: ?
  48. Species: ?
  49. Notes: Sequence Contained in assignment writeup
  50. Name: ftsa.txt
  51. Length: 1263 and 1272 bp
  52. Protein: FtsA (A cell division protein)
  53. Species: E. Coli K12 and Cresentus Caulobacter
  54. Notes:
  55. Name: stx1230.txt
  56. Length: 1213 and 1230
  57. Protein: StxA and StxB (Shiga Toxin Subunits A&B)
  58. Species: Shigella Dysenteriae & E.Coli 0157:H7
  59. Notes: This is the gene in the "Jack-in-the-Box" E. Coli epidemic in the
  60. Western US in 1993, as found in both the E.Coli itself and the original
  61. host for the virulence genes
  62. Name: ecoli5000.txt
  63. Length: 5000 and 5000 bp
  64. Proteins: First 5000 bp of genome
  65. Species: E. Coli K12 & E.Coli O157:H7
  66. Notes: The entire genomes of both bacteria are on the university of
  67. Wisconson website in text formats. Each is 4.4 mbp long.
  68. ecoli2500.txt, ecoli3000.txt, ecoli7000.txt, ecoli8000.txt,
  69. ecoli9000.txt, ecoli10000.txt, ecoli20000.txt, ecoli50000.txt,
  70. ecoli100000.txt, ecoli500000.txt, ecoli1000000.txt are similar.

comments powered by Disqus