Changing existing code base
| Previous: Runtime evaluation | Next: Last words |
Required course material for the lesson
GitHub: Professor Anders Gorm Pedersen's repositories. Used in teaching this week.
GitHub: Gene network. A much larger and thus more complicated/confusing repo. They are looking for help/contributers
GitHub: The Bioinformatics Repository. Different repos, various languages.
Subjects covered
Hands-on addition to a code base,
Exercises to be handed in
There are going to be only one exercise. You have the project, and the exercise will be big.
Clone Anders Gorm's repo: sequencelib and study it.
Make it able to read FASTQ files.
This includes changing the Seqfile class, adding a new Fastqfilehandle class, and understanding the Sequence class really well.
Of course, you are using Git to handle your changes. You might make some kind of limitation somewhere or maybe generator,
as FASTQ files get really big and can't all be in memory. Also, FASTQ are often gzipped - it is nice to read them directly from the gzip file.
You don't know FASTQ format? - look it up - this is supposed to be a "real world problem". You don't get much help.