<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://teaching.healthtech.dtu.dk/22113/index.php?action=history&amp;feed=atom&amp;title=Spider_toxins</id>
	<title>Spider toxins - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://teaching.healthtech.dtu.dk/22113/index.php?action=history&amp;feed=atom&amp;title=Spider_toxins"/>
	<link rel="alternate" type="text/html" href="https://teaching.healthtech.dtu.dk/22113/index.php?title=Spider_toxins&amp;action=history"/>
	<updated>2026-05-02T06:10:05Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://teaching.healthtech.dtu.dk/22113/index.php?title=Spider_toxins&amp;diff=70&amp;oldid=prev</id>
		<title>WikiSysop: Created page with &quot;__NOTOC__ === Description === Find all spider toxins in uniprot and output them in a fasta file. Who knows when it will be useful to produce venom?  === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database]. This will be your input file to your program.&lt;br&gt; Unpack it yourself with  gunzip uniprot_sprot.dat.gz or whatever method you prefer. Careful, it will take up 3 GB.&lt;br&gt; Notice there are many s...&quot;</title>
		<link rel="alternate" type="text/html" href="https://teaching.healthtech.dtu.dk/22113/index.php?title=Spider_toxins&amp;diff=70&amp;oldid=prev"/>
		<updated>2024-03-06T15:23:14Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;__NOTOC__ === Description === Find all spider toxins in uniprot and output them in a fasta file. Who knows when it will be useful to produce venom?  === Input/output === Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database]. This will be your input file to your program.&amp;lt;br&amp;gt; Unpack it yourself with  gunzip uniprot_sprot.dat.gz or whatever method you prefer. Careful, it will take up 3 GB.&amp;lt;br&amp;gt; Notice there are many s...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;__NOTOC__&lt;br /&gt;
=== Description ===&lt;br /&gt;
Find all spider toxins in uniprot and output them in a fasta file. Who knows when it will be useful to produce venom?&lt;br /&gt;
&lt;br /&gt;
=== Input/output ===&lt;br /&gt;
Download the entire [https://teaching.healthtech.dtu.dk/material/22113/uniprot_sprot.dat.gz swissprot database].&lt;br /&gt;
This will be your input file to your program.&amp;lt;br&amp;gt;&lt;br /&gt;
Unpack it yourself with&lt;br /&gt;
 gunzip uniprot_sprot.dat.gz&lt;br /&gt;
or whatever method you prefer. Careful, it will take up 3 GB.&amp;lt;br&amp;gt;&lt;br /&gt;
Notice there are many swissprot entries in the file and your program must handle that.&lt;br /&gt;
The output must be a fasta file, where the header is the swisprot ID of the entry and the protein sequence from the entry.&lt;br /&gt;
&lt;br /&gt;
=== How to pick the right entries/genes ===&lt;br /&gt;
Study the [https://teaching.healthtech.dtu.dk/material/22113/spidertoxins.txt example entry] carefully.&amp;lt;br&amp;gt;&lt;br /&gt;
First of all we are only interested in spider genes. The means that the OC line must contain &amp;quot;Araneae&amp;quot;.&lt;br /&gt;
Observe the CC lines in the entries. It can be seen that there are several sections and they start with &amp;quot;-!-&amp;quot; and capital letters describing the section. Pay attention to the following 3 sections: FUNCTION, TISSUE SPECIFICITY and SIMILARITY. From these sections you can derive that the protein is a toxin, like this:&lt;br /&gt;
&lt;br /&gt;
 Set a counter to 0.&lt;br /&gt;
 If a FUNCTION section exists and it contains the word &amp;quot;toxin&amp;quot;, increase the counter by 1.&lt;br /&gt;
 If a TISSUE SPECIFICITY section exists and it contains the words &amp;quot;venom gland&amp;quot;, increase the counter by 1.&lt;br /&gt;
 If a SIMILARITY section exists and it contains the word &amp;quot;toxin&amp;quot;, increase the counter by 1.&lt;br /&gt;
 If the counter is at least 2, then this is a spider toxin gene.&lt;/div&gt;</summary>
		<author><name>WikiSysop</name></author>
	</entry>
</feed>