summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorPaweł Hajdan <phajdan.jr@gentoo.org>2012-03-31 17:03:18 +0000
committerPaweł Hajdan <phajdan.jr@gentoo.org>2012-03-31 17:03:18 +0000
commit8ba2860ae7c5f5429e27ed1edd8753981376059a (patch)
tree65ab7f40697f559187fc6303fba1f00594969f7c /sci-biology/cd-hit
parentadd ~arm, bug #402989 (diff)
downloadgentoo-2-8ba2860ae7c5f5429e27ed1edd8753981376059a.tar.gz
gentoo-2-8ba2860ae7c5f5429e27ed1edd8753981376059a.tar.bz2
gentoo-2-8ba2860ae7c5f5429e27ed1edd8753981376059a.zip
Add upstream remote-id annotations using tools by Corentin Chary, bug #406287.
Diffstat (limited to 'sci-biology/cd-hit')
-rw-r--r--sci-biology/cd-hit/metadata.xml18
1 files changed, 9 insertions, 9 deletions
diff --git a/sci-biology/cd-hit/metadata.xml b/sci-biology/cd-hit/metadata.xml
index 0066dc245f77..e4184a444391 100644
--- a/sci-biology/cd-hit/metadata.xml
+++ b/sci-biology/cd-hit/metadata.xml
@@ -1,17 +1,15 @@
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE pkgmetadata SYSTEM "http://www.gentoo.org/dtd/metadata.dtd">
<pkgmetadata>
-<herd>sci-biology</herd>
-<maintainer>
- <email>jlec@gentoo.org</email>
-</maintainer>
-<longdescription>
-CD-HIT is a very widely used program for clustering and comparing large sets
+ <herd>sci-biology</herd>
+ <maintainer>
+ <email>jlec@gentoo.org</email>
+ </maintainer>
+ <longdescription>CD-HIT is a very widely used program for clustering and comparing large sets
of protein or nucleotide sequences. CD-HIT is very fast and can handle
extremely large databases. CD-HIT helps to significantly reduce the
computational and manual efforts in many sequence analysis tasks and aids in
understanding the data structure and correct the bias within a dataset.
-
The CD-HIT package has CD-HIT, CD-HIT-2D, CD-HIT-EST, CD-HIT-EST-2D,
CD-HIT-454, CD-HIT-PARA, PSI-CD-HIT and over a dozen scripts. CD-HIT
(CD-HIT-EST) clusters similar proteins (DNAs) into clusters that meet a
@@ -19,6 +17,8 @@ user-defined similarity threshold. CD-HIT-2D (CD-HIT-EST-2D) compares 2
datasets and identifies the sequences in db2 that are similar to db1 above
a threshold. CD-HIT-454 is a program to identify natural and artificial
duplicates from pyrosequencing reads. The usage of other programs and
-scripts can be found in CD-HIT user's guide.
-</longdescription>
+scripts can be found in CD-HIT user's guide.</longdescription>
+ <upstream>
+ <remote-id type="google-code">cdhit</remote-id>
+ </upstream>
</pkgmetadata>