From 0a047686f250758c115ef3b62c83713191222155 Mon Sep 17 00:00:00 2001 From: Munyoki Kilyungi Date: Mon, 3 Apr 2023 22:24:28 +0300 Subject: Document how to check for duplicates in dump Signed-off-by: Munyoki Kilyungi --- issues/dump-genewiki-metadata.gmi | 8 ++++++++ 1 file changed, 8 insertions(+) (limited to 'issues') diff --git a/issues/dump-genewiki-metadata.gmi b/issues/dump-genewiki-metadata.gmi index 3aa5410..c9e2bf1 100644 --- a/issues/dump-genewiki-metadata.gmi +++ b/issues/dump-genewiki-metadata.gmi @@ -18,6 +18,14 @@ Dump the tables: => https://www.dublincore.org/specifications/dublin-core/dcmi-terms/# DCMI Metadata Terms => https://sparql.uniprot.org/.well-known/sparql-examples/ UNIPROT sparql examples +## Checking for duplicates + +``` +ag "Observational study of gene-disease association" dump.ttl --pager='less -R' +ag "gn:symbol" | sort | less +ag "gn:anonSymbol" | sort | less +``` + ## Issues * Some entries in the GeneRIF table don't have any entries in the GeneRIF_BASIC table: -- cgit v1.2.3