User contributions for WikiSysop
Jump to navigation
Jump to search
19 November 2024
- 11:1011:10, 19 November 2024 diff hist +1 Programme →Curious about the exam current
- 11:0911:09, 19 November 2024 diff hist +2 m Programme No edit summary
- 11:0811:08, 19 November 2024 diff hist +278 m Programme No edit summary
2 September 2024
- 18:0418:04, 2 September 2024 diff hist 0 m Course preparation →WSL/WSL2, Windows Subsystem for Linux current
- 18:0318:03, 2 September 2024 diff hist −36 m Course preparation →WSL/WSL2, Windows Subsystem for Linux
- 18:0218:02, 2 September 2024 diff hist +1 m Course preparation →WSL/WSL2, Windows Subsystem for Linux
- 18:0218:02, 2 September 2024 diff hist −22 m Course preparation →WSL/WSL2, Windows Subsystem for Linux
15 May 2024
- 15:3415:34, 15 May 2024 diff hist +69 m Resistance to antibiotics →Optimization current
13 May 2024
- 18:2718:27, 13 May 2024 diff hist −7 Data analysis →Description current
- 18:2618:26, 13 May 2024 diff hist +4 Data analysis →Description
- 18:2518:25, 13 May 2024 diff hist +164 Data analysis →Description
- 17:0217:02, 13 May 2024 diff hist +159 Data analysis →Description
- 16:5616:56, 13 May 2024 diff hist +23 Data analysis →Description
- 16:5016:50, 13 May 2024 diff hist +103 Data analysis →Description
- 10:0410:04, 13 May 2024 diff hist +54 m Data analysis No edit summary
12 May 2024
- 23:1823:18, 12 May 2024 diff hist +7 m Data analysis →Description
- 23:1723:17, 12 May 2024 diff hist +569 m Data analysis →Description
22 April 2024
- 18:0518:05, 22 April 2024 diff hist −2 Aligning expectations →The report itself current
- 18:0518:05, 22 April 2024 diff hist +156 Aligning expectations →Report sections
- 17:5817:58, 22 April 2024 diff hist +127 m Aligning expectations →The code
- 17:5617:56, 22 April 2024 diff hist +29 Aligning expectations →The report itself
11 April 2024
- 08:0008:00, 11 April 2024 diff hist 0 m Scientific Libraries, Pandas, Numpy →Required course material for the lesson current
4 April 2024
- 15:0215:02, 4 April 2024 diff hist +1 m Scientific Libraries, Pandas, Numpy No edit summary
- 14:4514:45, 4 April 2024 diff hist −6 22113/22163 - Unix & Python Programming for Bioinformaticians →Resources current
- 14:4214:42, 4 April 2024 diff hist +144 m Project list No edit summary current
20 March 2024
- 11:0411:04, 20 March 2024 diff hist +3,660 Example code - Unit test No edit summary current
- 11:0411:04, 20 March 2024 diff hist +106 Example code - Classes No edit summary current
13 March 2024
- 15:0515:05, 13 March 2024 diff hist +21 Scientific Libraries, Statistics →Exercises to be handed in current
- 15:0315:03, 13 March 2024 diff hist −171 m Aligning expectations No edit summary
- 15:0215:02, 13 March 2024 diff hist +11,482 N Code construction Created page with "Learning how to construct good code is a main goal of the course. An important tool to facilitate this is peer evaluation. It is beneficial for you both to evaluate others, seeing other ways of solving the same problem - and to get feedback from other people on your own work. When evaluating you should check the teachers solutions to have some kind of reference apart from your own.<br> Tip: <span style="color:red">If you want your peer to pay specific attention to someth..." current
- 15:0115:01, 13 March 2024 diff hist +129 22113/22163 - Unix & Python Programming for Bioinformaticians →Course details
6 March 2024
- 17:0017:00, 6 March 2024 diff hist +2,596 N Shortest path in graph Created page with "__NOTOC__ === Description === The program is given as input a file containing connected nodes in a graph and a weight assigned to the edge between the nodes. The program shall answer the questions: Is there a path between two given nodes in the graph? If so, what is the shortest path ?<br> This is useful in a number of situations: Protein interaction, which proteins interact together, thereby discovering f.ex. new pathways. Social networks, who knows who, proving the "si..." current
- 16:2416:24, 6 March 2024 diff hist +1,846 N Positive proteins Created page with "__NOTOC__ === Description === Find the top 1000 most positively charged protein sequences in uniprot and put them in a fasta file. Repeat the search but this time find the most positively charged protein sequences per molecular weight of the sequence and put that into another fasta file. Among the 20 common amino acids, five have a side chain which can be charged. At pH=7, two are negative charged: aspartic acid (D) and glutamic acid (E) (acidic side chains), and three a..." current
- 16:2316:23, 6 March 2024 diff hist +1,614 N Spider toxins Created page with "__NOTOC__ === Description === Find all spider toxins in uniprot and output them in a fasta file. Who knows when it will be useful to produce venom? === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database]. This will be your input file to your program.<br> Unpack it yourself with gunzip uniprot_sprot.dat.gz or whatever method you prefer. Careful, it will take up 3 GB.<br> Notice there are many s..." current
- 16:2216:22, 6 March 2024 diff hist +1,493 N Fun with biology - find english words Created page with "__NOTOC__ === Description === Parse the entire uniprot database and extract the ID and the sequences. Find English words that are hidden (actually occur randomly) in the sequences. The words must be between 3 and 10 letters long, both inclusive. Display or save in a file the ID together with the words found in the sequence, but only if the total number of letters is 5 or more for that entry. === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/ma..." current
- 16:2116:21, 6 March 2024 diff hist +2,398 N Find short virus genes with disulfid bridges Created page with "__NOTOC__ === Description === Find all short (150 or less aa) virus genes in uniprot, that contain intrachain disulfid bridges. Interchain disulfide bonds can produce stable, covalently linked protein dimers, multimers or complexes, whereas intrachain disulfide bonds can contribute to protein folding and stability. === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database]. This will be your input..." current
- 16:1916:19, 6 March 2024 diff hist +1,658 N Find the mature part of human genes with a signal peptide Created page with "__NOTOC__ === Description === Find all human genes in uniprot with a signal peptide. Extract the entire sequence and create a fasta file with only the mature proteins. === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database]. This will be your input file to your program.<br> Unpack it yourself with gunzip uniprot_sprot.dat.gz or whatever method you prefer. Careful, it will take up 3 GB.<br> Not..." current
- 16:1816:18, 6 March 2024 diff hist +1,543 N Heuristic methods for fair sharing Created page with "__NOTOC__ === Description === Distributing jobs/items to a number of consumers in a fair way has a number of applications. In this project you must implement 5 methods mentioned in the [https://teaching.healthtech.dtu.dk/material/22112/HPCLife-LoadBalancing.ppt powerpoint]: Random Assignment, Round Robin, Max-Min Round Robin, Reverse Round Robin and Least Load. === Input/output === As can be seen from the powerpoint, there is some randomness in the input, i.e. the numb..." current
- 16:1616:16, 6 March 2024 diff hist 0 Score sequence data with a PSSM No edit summary current
- 16:1516:15, 6 March 2024 diff hist +1 Human genes with activities in more than one region of the cell No edit summary current
- 16:1316:13, 6 March 2024 diff hist −1 Human genes with activities in more than one region of the cell →Input/output
- 16:1316:13, 6 March 2024 diff hist +1,844 N Human genes with activities in more than one region of the cell Created page with "__NOTOC__ === Description === Find human genes which are targeted to more than one region in the cell. Save the genes in fasta format. === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/36610/uniprot_sprot.dat.gz swissprot database]. This will be your input file to your program.<br> Unpack it yourself with gunzip uniprot_sprot.dat.gz or whatever method you prefer. Careful, it will take up 3 GB.<br> Notice there are many swissprot entr..."
- 16:1016:10, 6 March 2024 diff hist +5,590 N Score sequence data with a PSSM Created page with "__NOTOC__ === Description === Position specific scoring matrices (PSSM) are statistically motivated sequence motif models that provide higher sensitivity and specificity than regular expressions. The project consists of reading a TRANSFAC matrix table, converting it to a log-likelihood matrix, which is used to find matching motifs in a DNA fasta file. Learn more about PSSM: https://en.wikipedia.org/wiki/Position_weight_matrix === Input and output === The program is giv..."
- 16:0916:09, 6 March 2024 diff hist +4,332 N Artificial Neural Network Created page with "__NOTOC__ ===Description=== Implement a simple artificial neural network algorithm with backpropagation in Python. ANNs are of great interest in bioinformatics. The institute has created many online prediction servers, which utilises ANNs.<br> The data is a part of a project at DTU HealthTech, which is about prediction of whether certain variations of a SNP will lead to a disease or not. A lot of work has already gone into preparing a data set for network training. The r..." current
- 16:0916:09, 6 March 2024 diff hist +1,916 N Pairwise alignment Created page with "__NOTOC__ ===Description=== Aligning sequences is of great importance in bioinformatics. Many discoveries are based on finding sequences that align to each other. Evolution theory and phylogeny are based on sequence alignments. This project is about implementing a well-known algorithm for aligning two sequences, i.e. finding where they match in an optimal fashion. You must choose to implement either: # Smith-Waterman alignment where the goal is to find the best local al..." current
- 16:0816:08, 6 March 2024 diff hist +6,333 N QT clustering Created page with "__NOTOC__ ===Description=== The program reads a number of data points (multi-dimensional vectors) from a file and partitions those into clusters. Clustering is important in discovering patterns or modes in multi-dimensional data sets. It is also a method of organizing data examples into similar groups (clusters). In this particular case, QT clustering partitions the data set such that each example (data point) is assigned to exactly one cluster. QT clustering is superior..." current
- 16:0716:07, 6 March 2024 diff hist +5,509 N Read trimmer for Next-Generation-Sequencing data Created page with "__NOTOC__ ===Description=== The advent of Next Generation Sequencing (NGS) technologies have transformed how biological research is being performed and today almost all biological fields use the technology for cutting edge discoveries. Today, a human genome can be sequenced in very short time for approximately $1000 giving unprecedented possibilities for investigating human traits, evolution and diseases. Similarly whole bacterial communities and their interplay with the..." current
- 16:0716:07, 6 March 2024 diff hist +3,133 N K-nearest neighbor (k-NN) continuous variable estimation Created page with "__NOTOC__ ===Description=== This scripts read a matrix-styled data file, containing missing values, and infers these values by finding the k-nearest neighbors. An application of this can be seen in Microarray experiments, in which the observed signal is not always significantly different from the background signal. Imputing these values are a cheaper solution rather than redoing the whole experiment. This method has been shown to perform better than e.g. rowmeans, and fa..." current
- 16:0516:05, 6 March 2024 diff hist 0 N File:Absent.jpg No edit summary current
- 16:0416:04, 6 March 2024 diff hist 0 N File:Present.jpg No edit summary current