Age | Commit message (Collapse) | Author |
|
This generalizes better and is mathematically cleaner.
* dump.scm (floor-log1024): New function.
(human-units): Use floor-log1024.
|
|
Using sxml allows us to stay in the world of S-expressions.
* dump.scm (sxml->xml-string, sxml->graphviz-html): New function.
(dump-schema): Construct graphviz HTML string using sxml.
|
|
* dump.scm (dump-schema): Highlight tables and columns.
|
|
* dump.scm (replace-substrings): New function.
(graph->dot): Fix HTML string handling.
|
|
* dump.scm (%dumped): New variable.
(define-dump): Append to %dumped when a new table dumping function is
defined.
|
|
* dump.scm (define-dump): New macro.
(dump-species, dump-strain, dump-mapping-method, dump-inbred-set,
dump-phenotype, dump-publication, dump-publish-xref, dump-tissue,
dump-investigators, dump-avg-method, dump-gene-chip, dump-info-files):
Redefine using define-dump.
|
|
* dump.scm (<column>): New type.
(tables): Use <column> objects to represent columns.
(trigrams, jaccard-index, jaccard-string-similarity): New functions.
(dump-schema): Use string similarity and check if foreign key is an
integer.
|
|
* rdf.py: Delete file.
|
|
* .dir-locals.el (scheme-mode): Indent set-table-columns correctly.
* dump.scm: Import (srfi srfi-9 gnu).
(%database-name): New variable.
(<table>): New type.
(tables, string-remove-suffix-ci, human-units, graph->dot,
dump-schema): New functions.
Invoke dump-schema.
* guix.scm: Import (gnu packages bioinformatics). Add ccwl, graphviz
and guile-libyaml to the manifest.
|
|
* dump.scm (get-tables-from-comments, dump-table-fields, dump-species,
dump-strain, dump-mapping-method, dump-inbred-set, dump-phenotype,
dump-publication, dump-publish-xref, dump-tissue, dump-investigators,
dump-avg-method, dump-gene-chip, dump-info-files): Use select-query.
|
|
* dump/sql.scm: Import (srfi srfi-1). Export select-query.
(select-query): New macro.
|
|
* .dir-locals.el: New file.
|
|
* dump.scm (camel->lower-camel): Delete function.
(default-metadata-proc): Do not use camel->lower-camel.
|
|
* dump.scm (dump-mapping-method, dump-publication, dump-info-files):
Use string->identifier to build subjects.
|
|
This is slightly more readable.
* dump.scm (string->identifier): Append an underscore to the
identifier prefix.
|
|
* dump.scm (get-tables-from-comments, dump-table-fields): Fix
indentation.
|
|
* dump.scm: Import (dump utils).
(string-blank?): New function.
(scm->triples): Filter out triples with #f or blank string objects.
(process-metadata-alist): Delete function.
(default-metadata-proc): New function.
(dump-species, dump-strain, mapping-method-name->id, dump-inbred-set,
dump-phenotype, dump-publication, dump-publish-xref, dump-tissue,
dump-investigators, dump-avg-method, dump-gene-chip, dump-info-files):
Use map-alist.
|
|
map-alist is a DSL to transform one association list into
another. These transformations are frequently required when dumping
tables, especially metadata tables.
* dump/utils.scm: New file.
|
|
* dump.scm (investigator-email->id): Rename to
investigator-attributes->id. Use first and last names in addition to
the email ID.
(dump-investigators): Use investigator-attributes->id. Include records
that have no email ID.
(dump-info-files): Use investigator-attributes->id. Include records
that have no email ID.
|
|
For the simple task of removing spaces, string-delete is
sufficient. string-replace-substring is overkill.
* dump.scm (fix-email-id): Use string-delete instead of
string-replace-substring.
|
|
Building a turtle identifier from a string after removing illegal
characters and prefixing is an extremely common operation. Abstract
it. Also, mandate identifier prefixes. It is better to play it safe.
* dump.scm (string->identifier): New function.
(binomial-name->species-id, dump-strain, mapping-method-name->id,
inbred-set-name->id, aphenotype-id->id, tissue-short-name->id,
investigator-email->id, avg-method-name->id, gene-chip-name->id): Use
string->identifier.
|
|
* dump.scm (delete-substring): Add docstring.
|
|
* dump.scm (dump-info-files): Deal with AvgMethodId.
|
|
Not all datasets have a non-NULL InfoPageTitle field.
* dump.scm (dump-info-files): Use InfoFileTitle instead of
InfoPageTitle for dataset name.
|
|
* dump.scm (dump-info-files): Extract name of dataset group.
|
|
Not all inbred sets have a mapping method, and the mapping method of
the inbred set has, so far, not been used anywhere.
* dump.scm (mapping-method-name->id, dump-mapping-method): Mark as
unused.
(dump-inbred-set): Do not link inbred-set to mapping-method.
|
|
* dump.scm (dump-avg-method): Allow N/A in name.
(dump-info-files): Allow N/A in avg-method-name.
(avg-method-name->id): Replace / with _.
|
|
* dump.scm (dump-info-files): Remove gn:geoSeries when value starts
with "No Geo Series...".
|
|
* dump.scm: In the gn:prefix, use http instead of https.
|
|
* guix.scm: Remove python-rdflib.
(python-berkeleydb, python-rdflib-6): Delete variables.
|
|
guix environment is deprecated.
* guix.scm: Replace guix environment with guix shell.
|
|
* dump.scm: Remove rdfs: prefix.
|
|
Parentheses became unbalanced due to my careless git use.
* dump.scm: Fix unbalanced parentheses.
|
|
* guix.scm: Do not prefix guix: in importing (gnu packages guile-xyz).
(guile-dbi, guile-dbi-bootstrap, guile-dbd-mysql): Delete variables.
|
|
* dump.scm (dump-info-files): New function.
[main]: Call dump-info-files.
|
|
* dump.scm (delete-substrings): New function.
(dump-publication): Replace string-replace-substring with
delete-substrings.
|
|
* guix.scm: Import (gnu packages dbm), (guix build-system python)
and (guix download).
(python-berkeleydb, python-rdflib-6): New variables.
[mainfest]: Replace python-rdflib with python-rdflib-6. Remove
python-urrlib3.
|
|
* guix.scm (guile-dbi): Update to 2.1.8.
(guile-dbi-bootstrap): Inherit from guile-dbi instead of
guix:guile-dbi.
Update comments about contributing upstream.
|
|
* dump.scm (dump-publication, dump-tissue): Do not define any property
to be a sub-property of rdfs:label.
|
|
* dump.scm (gene-chip-name->id, dump-gene-chip): New functions.
[main]: Call dump-gene-chip.
|
|
* dump.scm (avg-method-name->id, dump-avg-method): New functions.
[main]: Call dump-avg-method.
|
|
* dump.scm (camel->kebab): Delete function.
|
|
* dump.scm (fix-email-id, investigator-email->id, dump-investigators):
New functions.
Invoke dump-investigators.
|
|
* dump.scm (tissue-short-name->id, dump-tissue): New functions.
Invoke dump-tissue.
|
|
* dump.scm: Add foaf prefix.
|
|
* dump.scm (prefix): New function.
Use prefix.
|
|
|