<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://teaching.healthtech.dtu.dk/22140/index.php?action=history&amp;feed=atom&amp;title=ExUniProt-answers</id>
	<title>ExUniProt-answers - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://teaching.healthtech.dtu.dk/22140/index.php?action=history&amp;feed=atom&amp;title=ExUniProt-answers"/>
	<link rel="alternate" type="text/html" href="https://teaching.healthtech.dtu.dk/22140/index.php?title=ExUniProt-answers&amp;action=history"/>
	<updated>2026-05-02T08:20:30Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://teaching.healthtech.dtu.dk/22140/index.php?title=ExUniProt-answers&amp;diff=16&amp;oldid=prev</id>
		<title>WikiSysop: Created page with &quot;== Answers to &quot;Exercise: Protein databases&quot; ==  The numbers are found using UniProt on Feb 10, 2017 (release 2017_01).  ===Simple text mining===  &#039;&#039;&#039;QUESTION 1:&#039;&#039;&#039; # How many hits do you find? &lt;br&gt;3150 # How many of these hits are from Swiss-Prot? &lt;br&gt;1254 # Can you identify the correct hit (&#039;&#039;i.e.&#039;&#039; see which one is actually human insulin and not something else)? &lt;br&gt;It&#039;s P01308 / INS_HUMAN (among the first ten hits).  &#039;&#039;&#039;QUESTION 2:&#039;&#039;&#039; How many hits are now left? How m...&quot;</title>
		<link rel="alternate" type="text/html" href="https://teaching.healthtech.dtu.dk/22140/index.php?title=ExUniProt-answers&amp;diff=16&amp;oldid=prev"/>
		<updated>2024-03-05T11:13:43Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Answers to &amp;quot;Exercise: Protein databases&amp;quot; ==  The numbers are found using UniProt on Feb 10, 2017 (release 2017_01).  ===Simple text mining===  &amp;#039;&amp;#039;&amp;#039;QUESTION 1:&amp;#039;&amp;#039;&amp;#039; # How many hits do you find? &amp;lt;br&amp;gt;3150 # How many of these hits are from Swiss-Prot? &amp;lt;br&amp;gt;1254 # Can you identify the correct hit (&amp;#039;&amp;#039;i.e.&amp;#039;&amp;#039; see which one is actually human insulin and not something else)? &amp;lt;br&amp;gt;It&amp;#039;s P01308 / INS_HUMAN (among the first ten hits).  &amp;#039;&amp;#039;&amp;#039;QUESTION 2:&amp;#039;&amp;#039;&amp;#039; How many hits are now left? How m...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Answers to &amp;quot;Exercise: Protein databases&amp;quot; ==&lt;br /&gt;
&lt;br /&gt;
The numbers are found using UniProt on Feb 10, 2017 (release 2017_01).&lt;br /&gt;
&lt;br /&gt;
===Simple text mining===&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 1:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
# How many hits do you find? &amp;lt;br&amp;gt;3150&lt;br /&gt;
# How many of these hits are from Swiss-Prot? &amp;lt;br&amp;gt;1254&lt;br /&gt;
# Can you identify the correct hit (&amp;#039;&amp;#039;i.e.&amp;#039;&amp;#039; see which one is actually human insulin and not something else)? &amp;lt;br&amp;gt;It&amp;#039;s P01308 / INS_HUMAN (among the first ten hits).&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 2:&amp;#039;&amp;#039;&amp;#039; How many hits are now left? How many of these are from Swiss-Prot? &amp;lt;br&amp;gt;1298 and 895&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 3:&amp;#039;&amp;#039;&amp;#039; How many hits are now left? How many of these are from Swiss-Prot? &amp;lt;br&amp;gt;195 and 60&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 4:&amp;#039;&amp;#039;&amp;#039; How many hits are now left? &amp;lt;br&amp;gt;100&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 5:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
# How did you do this? &amp;lt;br&amp;gt;by adding &amp;lt;tt&amp;gt;NOT name:receptor&amp;lt;/tt&amp;gt; to the query box.&lt;br /&gt;
# How many hits are now left? &amp;lt;br&amp;gt;48&lt;br /&gt;
&lt;br /&gt;
===The contents of UniProt===&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 6:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
# How many references are there in the insulin entry? &amp;lt;br&amp;gt;36&lt;br /&gt;
# Why do you think insulin is such a highly investigated protein? &amp;lt;br&amp;gt;Because it is linked to a common and serious disease (diabetes) and used as a drug.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 7:&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
# Where do you find insulin? &amp;lt;br&amp;gt;It is secreted from the cell (this is written just below the section heading. Under  &amp;lt;u&amp;gt;GO - Cellular component&amp;lt;/u&amp;gt; you can find additional locations mentioned, such as &amp;lt;u&amp;gt;endoplasmic reticulum lumen&amp;lt;/u&amp;gt;, but these are temporary stages on the way to secretion).&lt;br /&gt;
# Why do you think is it found there? &amp;lt;br&amp;gt;Because it is a hormone - it has to travel through the bloodstream to influence other cells.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 8:&amp;#039;&amp;#039;&amp;#039; How long is the signal peptide and the propeptide, respectively? &amp;lt;br&amp;gt;24 and 31 amino acids.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 9:&amp;#039;&amp;#039;&amp;#039; Which positions are in β-sheet conformation in insulin?  &amp;lt;br&amp;gt;Positions 26-29, 48-50, 74-76, and 98-101.&lt;br /&gt;
&lt;br /&gt;
===Other databases linked from Swiss-Prot===&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;No questions asked here.&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
===Advanced search===&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 10:&amp;#039;&amp;#039;&amp;#039; How many proteins did you find, and what was the search string (the text in the search field)? &amp;lt;br&amp;gt;5,186,371&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;annotation:(type:signal)&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 11:&amp;#039;&amp;#039;&amp;#039; How many proteins do you find now, and what has the search string changed into? &amp;lt;br&amp;gt;3486 &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;annotation:(type:signal evidence:experimental)&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 12:&amp;#039;&amp;#039;&amp;#039; How many proteins do you find now, and what is the search string? &amp;lt;br&amp;gt;707 &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;annotation:(type:signal evidence:experimental) AND organism:&amp;quot;Homo sapiens (Human) [9606]&amp;quot;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 13 a:&amp;#039;&amp;#039;&amp;#039; How many proteins are there in UniProt from &amp;#039;&amp;#039;Neisseria gonorrhoeae&amp;#039;&amp;#039; with the default TaxID [485]? &amp;lt;br&amp;gt;9203&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 13 b:&amp;#039;&amp;#039;&amp;#039; How many proteins are there in UniProt from &amp;#039;&amp;#039;Neisseria gonorrhoeae&amp;#039;&amp;#039; in total (all strains and subspecies)? &amp;lt;br&amp;gt;18,596 (twice as many)&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 13 c:&amp;#039;&amp;#039;&amp;#039; What does the search string look like now? &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;taxonomy:&amp;quot;Neisseria gonorrhoeae [485]&amp;quot;&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 14:&amp;#039;&amp;#039;&amp;#039; How many proteins of maximum length 10 do you find? &amp;lt;br&amp;gt;32,090 &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;length:[1 TO 10]&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 15:&amp;#039;&amp;#039;&amp;#039; How many proteins are now left? &amp;lt;br&amp;gt;1280 &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;length:[1 TO 10] existence:&amp;quot;evidence at protein level&amp;quot;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 16:&amp;#039;&amp;#039;&amp;#039; How many proteins are now left? &amp;lt;br&amp;gt;830  &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;length:[1 TO 10] existence:&amp;quot;evidence at protein level&amp;quot; fragment:no&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 17:&amp;#039;&amp;#039;&amp;#039; How many human non-fragment proteins of maximum length 10 do you find in UniProt? &amp;lt;br&amp;gt;5 &amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;length:[1 TO 10] existence:&amp;quot;evidence at protein level&amp;quot; fragment:no AND organism:&amp;quot;Human [9606]&amp;quot;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;QUESTION 18:&amp;#039;&amp;#039;&amp;#039; Here they are in FASTA format:&lt;br /&gt;
 &amp;gt;sp|P01358|GAJU_HUMAN Gastric juice peptide 1 OS=Homo sapiens PE=1 SV=1&lt;br /&gt;
 LAAGKVEDSD&lt;br /&gt;
 &amp;gt;sp|P02728|GLEM_HUMAN Erythrocyte membrane glycopeptide OS=Homo sapiens PE=1 SV=1&lt;br /&gt;
 CEGHSHDHGA&lt;br /&gt;
 &amp;gt;sp|P02729|GLUR_HUMAN Urine glycopeptide OS=Homo sapiens PE=1 SV=1&lt;br /&gt;
 CEHSHDGA&lt;br /&gt;
 &amp;gt;sp|P22103|PNEU_HUMAN Pneumadin OS=Homo sapiens PE=1 SV=1&lt;br /&gt;
 AGEPKLDAGV&lt;br /&gt;
 &amp;gt;sp|P01858|TUFT_HUMAN Phagocytosis-stimulating peptide OS=Homo sapiens PE=1 SV=1&lt;br /&gt;
 TKPR&lt;/div&gt;</summary>
		<author><name>WikiSysop</name></author>
	</entry>
</feed>