quality control of delimited files
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
jgart 29cf933191 update todos 1 month ago
src update readme 1 month ago
t Fix failing tests 1 month ago
.gitignore Ignore emacs temporary files and compiled lisp files 1 month ago
README.md update todos 1 month ago
UNLICENSE add unlicense 1 month ago
manifest.scm update manifest 3 months ago
qc.asd upate asd file 3 months ago
run-tests.lisp Add missing "run-tests.lisp" file 1 month ago
strains.csv add strains file 2 months ago

README.md

qc

qc CI badge

project goals

This project started as a collaboration with Arthur Centeno to check tsv files for the following critera:

  • no empty data cells
  • no data cells with spurious characters like eeeee, 5.555iloveguix, etc...
  • decimal numbers must conform to the following criteria:
    • when checking an average file decimal numbers must contain exactly three places to the right side of the dot.
    • when checking a standard error file decimal numbers must contain six or greater places to the right side of the dot.
    • there must be a number to the left side of the dot (e.g. 0.55555 is allowed but .55555 is not).
  • check line endings to make sure they are Unix and not DOS
  • check strain headers against a source of truth (see strains.csv)

todos

  • do not hard code path when loading strain files
  • full test coverage

development with guix

guix shell --container --manifest=manifest.scm

running tests

sbcl --load run-tests.lisp