about summary refs log tree commit diff
path: root/scripts/lmdb_matrix.py
AgeCommit message (Collapse)Author
2025-07-16Set a larger map size for the larger genotype files.Munyoki Kilyungi
Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Add functionality for importing one entire genotype directory.Munyoki Kilyungi
Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16More efficiently and correctly count nrows is genotype.Munyoki Kilyungi
Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Delete un-used import.Munyoki Kilyungi
Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Add module doc-string.Munyoki Kilyungi
Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Improve genotype file parsing.Munyoki Kilyungi
* scripts/lmdb_matrix.py (GenotypeMatrix): Delete transpose. Add "file_info" field. (save_excursion, count_lines): Delete. (get_genotype_dimensions, read_genotype_headers): New function. (read_geno_file): Rename this to... (read_genotype_file): ... this. (genotype_db_put): Put "file_info". (genotype_db_get): Fetch "file_info". (import_genotype): Use read_genotype_file. (print_current_matrix): Remove transpose. Print out file_info metadata. Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Simplify genotype db script.Munyoki Kilyungi
Hashing and versioning made the script very slow. Now, this script can handle much larger data. Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>
2025-07-16Add genotype matrix script.Munyoki Kilyungi
* scripts/lmdb_matrix.py: New file. Signed-off-by: Munyoki Kilyungi <me@bonfacemunyoki.com>