Command Line Interface¶

usage: cubi-tk [-h] [--verbose] [--version] [--config CONFIG]
               [--sodar-server-url SODAR_SERVER_URL]
               [--sodar-api-token SODAR_API_TOKEN]
               {isa-tpl,isa-tab,snappy,sodar,irods,org-raw,sea-snap,archive}
               ...

Positional Arguments¶

cmd

Possible choices: isa-tpl, isa-tab, snappy, sodar, irods, org-raw, sea-snap, archive

Named Arguments¶

--verbose

Increase verbosity.

Default: False

--version

show program’s version number and exit

Basic Configuration¶

`--config`	Path to configuration file.
`--sodar-server-url`
	SODAR server URL key to use, defaults to env SODAR_SERVER_URL.
`--sodar-api-token`
	SODAR API token to use, defaults to env SODAR_API_TOKEN.

Sub-commands¶

isa-tpl¶

Create of ISA-tab directories from predefined templates.

cubi-tk isa-tpl [-h]
                {single_cell_rnaseq,bulk_rnaseq,tumor_normal_dna,tumor_normal_triplets,germline,generic,microarray,ms_meta_biocrates,stem_cell_core_bulk,stem_cell_core_sc}
                ...

Positional Arguments¶

tpl

Possible choices: single_cell_rnaseq, bulk_rnaseq, tumor_normal_dna, tumor_normal_triplets, germline, generic, microarray, ms_meta_biocrates, stem_cell_core_bulk, stem_cell_core_sc

Sub-commands¶

single_cell_rnaseq¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl single_cell_rnaseq [-h]
                                   [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                   [--var-sample-names VAR_SAMPLE_NAMES]
                                   [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                                   [--var-lib-kit VAR_LIB_KIT]
                                   [--var-source-meta VAR_SOURCE_META]
                                   [--var-sample-meta VAR_SAMPLE_META]
                                   [--var-dissociation-meta VAR_DISSOCIATION_META]
                                   [--var-library-construction-meta VAR_LIBRARY_CONSTRUCTION_META]
                                   [--var-sequencing-meta VAR_SEQUENCING_META]
                                   [--var--library-types VAR__LIBRARY_TYPES]
                                   [--var-library-type VAR_LIBRARY_TYPE]
                                   [--var-sample-multiplexing VAR_SAMPLE_MULTIPLEXING]
                                   [--var-genotype-multiplexing VAR_GENOTYPE_MULTIPLEXING]
                                   [--var-study-title VAR_STUDY_TITLE]
                                   [--var-s-file-name VAR_S_FILE_NAME]
                                   [--var-assay-prefix VAR_ASSAY_PREFIX]
                                   [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                                   [--var-assay-name VAR_ASSAY_NAME]
                                   [--var---output-dir VAR___OUTPUT_DIR]
                                   output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-source-meta`
	template variables ‘source_meta’
`--var-sample-meta`
	template variables ‘sample_meta’
`--var-dissociation-meta`
	template variables ‘dissociation_meta’
`--var-library-construction-meta`
	template variables ‘library_construction_meta’
`--var-sequencing-meta`
	template variables ‘sequencing_meta’
`--var--library-types`
	template variables ‘_library_types’
`--var-library-type`
	template variables ‘library_type’
`--var-sample-multiplexing`
	template variables ‘sample_multiplexing’
`--var-genotype-multiplexing`
	template variables ‘genotype_multiplexing’
`--var-study-title`
	template variables ‘study_title’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-assay-name`
	template variables ‘assay_name’
`--var---output-dir`
	template variables ‘__output_dir’

bulk_rnaseq¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl bulk_rnaseq [-h]
                            [--var-investigation-title VAR_INVESTIGATION_TITLE]
                            [--var-sample-names VAR_SAMPLE_NAMES]
                            [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                            [--var-a-measurement-types VAR_A_MEASUREMENT_TYPES]
                            [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                            [--var-a-technology-types VAR_A_TECHNOLOGY_TYPES]
                            [--var-lib-kit VAR_LIB_KIT]
                            [--var-organism VAR_ORGANISM]
                            [--var-batch VAR_BATCH]
                            [--var-lib-kits VAR_LIB_KITS]
                            [--var-organisms VAR_ORGANISMS]
                            [--var-instrument VAR_INSTRUMENT]
                            [--var-center-name VAR_CENTER_NAME]
                            [--var-center-contact VAR_CENTER_CONTACT]
                            [--var-study-title VAR_STUDY_TITLE]
                            [--var-s-file-name VAR_S_FILE_NAME]
                            [--var-assay-prefix VAR_ASSAY_PREFIX]
                            [--var-a-measurement-abbreviation VAR_A_MEASUREMENT_ABBREVIATION]
                            [--var-assay-name VAR_ASSAY_NAME]
                            [--var-sample-type VAR_SAMPLE_TYPE]
                            [--var-lib-strategy VAR_LIB_STRATEGY]
                            [--var-lib-selection VAR_LIB_SELECTION]
                            [--var-lib-layout VAR_LIB_LAYOUT]
                            [--var---output-dir VAR___OUTPUT_DIR]
                            output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-a-measurement-types`
	template variables ‘a_measurement_types’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-a-technology-types`
	template variables ‘a_technology_types’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-organism`	template variables ‘organism’
`--var-batch`	template variables ‘batch’
`--var-lib-kits`	template variables ‘lib_kits’
`--var-organisms`
	template variables ‘organisms’
`--var-instrument`
	template variables ‘instrument’
`--var-center-name`
	template variables ‘center_name’
`--var-center-contact`
	template variables ‘center_contact’
`--var-study-title`
	template variables ‘study_title’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-measurement-abbreviation`
	template variables ‘a_measurement_abbreviation’
`--var-assay-name`
	template variables ‘assay_name’
`--var-sample-type`
	template variables ‘sample_type’
`--var-lib-strategy`
	template variables ‘lib_strategy’
`--var-lib-selection`
	template variables ‘lib_selection’
`--var-lib-layout`
	template variables ‘lib_layout’
`--var---output-dir`
	template variables ‘__output_dir’

tumor_normal_dna¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl tumor_normal_dna [-h]
                                 [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                 [--var-sample-names VAR_SAMPLE_NAMES]
                                 [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                                 [--var-lib-kit VAR_LIB_KIT]
                                 [--var-lib-kits VAR_LIB_KITS]
                                 [--var-instrument VAR_INSTRUMENT]
                                 [--var-center-name VAR_CENTER_NAME]
                                 [--var-center-contact VAR_CENTER_CONTACT]
                                 [--var-study-title VAR_STUDY_TITLE]
                                 [--var-is-triplet VAR_IS_TRIPLET]
                                 [--var-s-file-name VAR_S_FILE_NAME]
                                 [--var-assay-prefix VAR_ASSAY_PREFIX]
                                 [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                                 [--var-a-measurement-abbreviation VAR_A_MEASUREMENT_ABBREVIATION]
                                 [--var-assay-name VAR_ASSAY_NAME]
                                 [--var-sample-type VAR_SAMPLE_TYPE]
                                 [--var-lib-strategy VAR_LIB_STRATEGY]
                                 [--var-lib-selection VAR_LIB_SELECTION]
                                 [--var-lib-layout VAR_LIB_LAYOUT]
                                 [--var---output-dir VAR___OUTPUT_DIR]
                                 output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-lib-kits`	template variables ‘lib_kits’
`--var-instrument`
	template variables ‘instrument’
`--var-center-name`
	template variables ‘center_name’
`--var-center-contact`
	template variables ‘center_contact’
`--var-study-title`
	template variables ‘study_title’
`--var-is-triplet`
	template variables ‘is_triplet’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-a-measurement-abbreviation`
	template variables ‘a_measurement_abbreviation’
`--var-assay-name`
	template variables ‘assay_name’
`--var-sample-type`
	template variables ‘sample_type’
`--var-lib-strategy`
	template variables ‘lib_strategy’
`--var-lib-selection`
	template variables ‘lib_selection’
`--var-lib-layout`
	template variables ‘lib_layout’
`--var---output-dir`
	template variables ‘__output_dir’

tumor_normal_triplets¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl tumor_normal_triplets [-h]
                                      [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                      [--var-sample-names VAR_SAMPLE_NAMES]
                                      [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                                      [--var-lib-kit VAR_LIB_KIT]
                                      [--var-lib-kits VAR_LIB_KITS]
                                      [--var-instrument VAR_INSTRUMENT]
                                      [--var-center-name VAR_CENTER_NAME]
                                      [--var-center-contact VAR_CENTER_CONTACT]
                                      [--var-study-title VAR_STUDY_TITLE]
                                      [--var-is-triplet VAR_IS_TRIPLET]
                                      [--var-s-file-name VAR_S_FILE_NAME]
                                      [--var-assay-prefix VAR_ASSAY_PREFIX]
                                      [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                                      [--var-a-measurement-abbreviation VAR_A_MEASUREMENT_ABBREVIATION]
                                      [--var-assay-name VAR_ASSAY_NAME]
                                      [--var-sample-type VAR_SAMPLE_TYPE]
                                      [--var-lib-strategy VAR_LIB_STRATEGY]
                                      [--var-lib-selection VAR_LIB_SELECTION]
                                      [--var-lib-layout VAR_LIB_LAYOUT]
                                      [--var---output-dir VAR___OUTPUT_DIR]
                                      output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-lib-kits`	template variables ‘lib_kits’
`--var-instrument`
	template variables ‘instrument’
`--var-center-name`
	template variables ‘center_name’
`--var-center-contact`
	template variables ‘center_contact’
`--var-study-title`
	template variables ‘study_title’
`--var-is-triplet`
	template variables ‘is_triplet’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-a-measurement-abbreviation`
	template variables ‘a_measurement_abbreviation’
`--var-assay-name`
	template variables ‘assay_name’
`--var-sample-type`
	template variables ‘sample_type’
`--var-lib-strategy`
	template variables ‘lib_strategy’
`--var-lib-selection`
	template variables ‘lib_selection’
`--var-lib-layout`
	template variables ‘lib_layout’
`--var---output-dir`
	template variables ‘__output_dir’

germline¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl germline [-h]
                         [--var-investigation-title VAR_INVESTIGATION_TITLE]
                         [--var-sample-names VAR_SAMPLE_NAMES]
                         [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                         [--var-lib-kit VAR_LIB_KIT] [--var-batch VAR_BATCH]
                         [--var-lib-kits VAR_LIB_KITS]
                         [--var-instrument VAR_INSTRUMENT]
                         [--var-center-name VAR_CENTER_NAME]
                         [--var-center-contact VAR_CENTER_CONTACT]
                         [--var-study-title VAR_STUDY_TITLE]
                         [--var-s-file-name VAR_S_FILE_NAME]
                         [--var-assay-prefix VAR_ASSAY_PREFIX]
                         [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                         [--var-a-measurement-abbreviation VAR_A_MEASUREMENT_ABBREVIATION]
                         [--var-assay-name VAR_ASSAY_NAME]
                         [--var-sample-type VAR_SAMPLE_TYPE]
                         [--var-lib-strategy VAR_LIB_STRATEGY]
                         [--var-lib-selection VAR_LIB_SELECTION]
                         [--var-lib-layout VAR_LIB_LAYOUT]
                         [--var---output-dir VAR___OUTPUT_DIR]
                         output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-batch`	template variables ‘batch’
`--var-lib-kits`	template variables ‘lib_kits’
`--var-instrument`
	template variables ‘instrument’
`--var-center-name`
	template variables ‘center_name’
`--var-center-contact`
	template variables ‘center_contact’
`--var-study-title`
	template variables ‘study_title’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-a-measurement-abbreviation`
	template variables ‘a_measurement_abbreviation’
`--var-assay-name`
	template variables ‘assay_name’
`--var-sample-type`
	template variables ‘sample_type’
`--var-lib-strategy`
	template variables ‘lib_strategy’
`--var-lib-selection`
	template variables ‘lib_selection’
`--var-lib-layout`
	template variables ‘lib_layout’
`--var---output-dir`
	template variables ‘__output_dir’

generic¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl generic [-h]
                        [--var-investigation-title VAR_INVESTIGATION_TITLE]
                        [--var-sample-names VAR_SAMPLE_NAMES]
                        [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                        [--var-a-measurement-types VAR_A_MEASUREMENT_TYPES]
                        [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                        [--var-a-technology-types VAR_A_TECHNOLOGY_TYPES]
                        [--var-lib-kit VAR_LIB_KIT]
                        [--var-organism VAR_ORGANISM] [--var-batch VAR_BATCH]
                        [--var-lib-kits VAR_LIB_KITS]
                        [--var-organisms VAR_ORGANISMS]
                        [--var-instrument VAR_INSTRUMENT]
                        [--var-center-name VAR_CENTER_NAME]
                        [--var-center-contact VAR_CENTER_CONTACT]
                        [--var-study-title VAR_STUDY_TITLE]
                        [--var-s-file-name VAR_S_FILE_NAME]
                        [--var-assay-prefix VAR_ASSAY_PREFIX]
                        [--var-a-measurement-abbreviation VAR_A_MEASUREMENT_ABBREVIATION]
                        [--var-assay-name VAR_ASSAY_NAME]
                        [--var-sample-type VAR_SAMPLE_TYPE]
                        [--var-lib-strategy VAR_LIB_STRATEGY]
                        [--var-lib-selection VAR_LIB_SELECTION]
                        [--var-lib-layout VAR_LIB_LAYOUT]
                        [--var---output-dir VAR___OUTPUT_DIR]
                        output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-a-measurement-types`
	template variables ‘a_measurement_types’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-a-technology-types`
	template variables ‘a_technology_types’
`--var-lib-kit`	template variables ‘lib_kit’
`--var-organism`	template variables ‘organism’
`--var-batch`	template variables ‘batch’
`--var-lib-kits`	template variables ‘lib_kits’
`--var-organisms`
	template variables ‘organisms’
`--var-instrument`
	template variables ‘instrument’
`--var-center-name`
	template variables ‘center_name’
`--var-center-contact`
	template variables ‘center_contact’
`--var-study-title`
	template variables ‘study_title’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-measurement-abbreviation`
	template variables ‘a_measurement_abbreviation’
`--var-assay-name`
	template variables ‘assay_name’
`--var-sample-type`
	template variables ‘sample_type’
`--var-lib-strategy`
	template variables ‘lib_strategy’
`--var-lib-selection`
	template variables ‘lib_selection’
`--var-lib-layout`
	template variables ‘lib_layout’
`--var---output-dir`
	template variables ‘__output_dir’

microarray¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl microarray [-h]
                           [--var-investigation-title VAR_INVESTIGATION_TITLE]
                           [--var-sample-names VAR_SAMPLE_NAMES]
                           [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                           [--var-organism VAR_ORGANISM]
                           [--var-organisms VAR_ORGANISMS]
                           [--var-technology-platform VAR_TECHNOLOGY_PLATFORM]
                           [--var-array-design-ref VAR_ARRAY_DESIGN_REF]
                           [--var-study-title VAR_STUDY_TITLE]
                           [--var-s-file-name VAR_S_FILE_NAME]
                           [--var-assay-prefix VAR_ASSAY_PREFIX]
                           [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                           [--var-assay-name VAR_ASSAY_NAME]
                           [--var-terms VAR_TERMS]
                           [--var---output-dir VAR___OUTPUT_DIR]
                           output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-organism`	template variables ‘organism’
`--var-organisms`
	template variables ‘organisms’
`--var-technology-platform`
	template variables ‘technology_platform’
`--var-array-design-ref`
	template variables ‘array_design_ref’
`--var-study-title`
	template variables ‘study_title’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-assay-name`
	template variables ‘assay_name’
`--var-terms`	template variables ‘terms’
`--var---output-dir`
	template variables ‘__output_dir’

ms_meta_biocrates¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl ms_meta_biocrates [-h]
                                  [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                  [--var-study-title VAR_STUDY_TITLE]
                                  [--var-study-id VAR_STUDY_ID]
                                  [--var-study-file-name VAR_STUDY_FILE_NAME]
                                  [--var-sample-names VAR_SAMPLE_NAMES]
                                  [--var-organism VAR_ORGANISM]
                                  [--var-organisms VAR_ORGANISMS]
                                  [--var-assay-measurement-type VAR_ASSAY_MEASUREMENT_TYPE]
                                  [--var-assay-technology-type VAR_ASSAY_TECHNOLOGY_TYPE]
                                  [--var-assay-technology-types VAR_ASSAY_TECHNOLOGY_TYPES]
                                  [--var-biocrates-kit VAR_BIOCRATES_KIT]
                                  [--var-assay-prefix VAR_ASSAY_PREFIX]
                                  [--var-assay-name VAR_ASSAY_NAME]
                                  [--var-assay-measurement-abbreviation-LC VAR_ASSAY_MEASUREMENT_ABBREVIATION_LC]
                                  [--var-assay-measurement-abbreviation-FIA VAR_ASSAY_MEASUREMENT_ABBREVIATION_FIA]
                                  [--var-biocrates-metidq-version VAR_BIOCRATES_METIDQ_VERSION]
                                  [--var-metaquac-version VAR_METAQUAC_VERSION]
                                  [--var-instrument VAR_INSTRUMENT]
                                  [--var-instruments VAR_INSTRUMENTS]
                                  [--var-chromatography-instrument VAR_CHROMATOGRAPHY_INSTRUMENT]
                                  [--var---output-dir VAR___OUTPUT_DIR]
                                  output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-investigation-title`
	template variables ‘investigation_title’
`--var-study-title`
	template variables ‘study_title’
`--var-study-id`	template variables ‘study_id’
`--var-study-file-name`
	template variables ‘study_file_name’
`--var-sample-names`
	template variables ‘sample_names’
`--var-organism`	template variables ‘organism’
`--var-organisms`
	template variables ‘organisms’
`--var-assay-measurement-type`
	template variables ‘assay_measurement_type’
`--var-assay-technology-type`
	template variables ‘assay_technology_type’
`--var-assay-technology-types`
	template variables ‘assay_technology_types’
`--var-biocrates-kit`
	template variables ‘biocrates_kit’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-assay-name`
	template variables ‘assay_name’
`--var-assay-measurement-abbreviation-LC`
	template variables ‘assay_measurement_abbreviation_LC’
`--var-assay-measurement-abbreviation-FIA`
	template variables ‘assay_measurement_abbreviation_FIA’
`--var-biocrates-metidq-version`
	template variables ‘biocrates_metidq_version’
`--var-metaquac-version`
	template variables ‘metaquac_version’
`--var-instrument`
	template variables ‘instrument’
`--var-instruments`
	template variables ‘instruments’
`--var-chromatography-instrument`
	template variables ‘chromatography_instrument’
`--var---output-dir`
	template variables ‘__output_dir’

stem_cell_core_bulk¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl stem_cell_core_bulk [-h] [--var-group VAR_GROUP]
                                    [--var-study-title VAR_STUDY_TITLE]
                                    [--var-sample-numbers VAR_SAMPLE_NUMBERS]
                                    [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                    [--var-source-type VAR_SOURCE_TYPE]
                                    [--var-cellline VAR_CELLLINE]
                                    [--var-cellculture-meta VAR_CELLCULTURE_META]
                                    [--var-model-type VAR_MODEL_TYPE]
                                    [--var-sample-meta VAR_SAMPLE_META]
                                    [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                                    [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                                    [--var-library-kit VAR_LIBRARY_KIT]
                                    [--var-library-construction-meta VAR_LIBRARY_CONSTRUCTION_META]
                                    [--var-sequencing-meta VAR_SEQUENCING_META]
                                    [--var-s-file-name VAR_S_FILE_NAME]
                                    [--var-assay-prefix VAR_ASSAY_PREFIX]
                                    [--var-assay-name VAR_ASSAY_NAME]
                                    [--var---output-dir VAR___OUTPUT_DIR]
                                    output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-group`	template variables ‘group’
`--var-study-title`
	template variables ‘study_title’
`--var-sample-numbers`
	template variables ‘sample_numbers’
`--var-investigation-title`
	template variables ‘investigation_title’
`--var-source-type`
	template variables ‘source_type’
`--var-cellline`	template variables ‘cellline’
`--var-cellculture-meta`
	template variables ‘cellculture_meta’
`--var-model-type`
	template variables ‘model_type’
`--var-sample-meta`
	template variables ‘sample_meta’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-library-kit`
	template variables ‘library_kit’
`--var-library-construction-meta`
	template variables ‘library_construction_meta’
`--var-sequencing-meta`
	template variables ‘sequencing_meta’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-assay-name`
	template variables ‘assay_name’
`--var---output-dir`
	template variables ‘__output_dir’

stem_cell_core_sc¶

When specifying the –var-* argument, you can use JSON syntax. Failing to parse JSON will keep the string value.

cubi-tk isa-tpl stem_cell_core_sc [-h] [--var-group VAR_GROUP]
                                  [--var-study-title VAR_STUDY_TITLE]
                                  [--var-sample-names VAR_SAMPLE_NAMES]
                                  [--var-investigation-title VAR_INVESTIGATION_TITLE]
                                  [--var-source-type VAR_SOURCE_TYPE]
                                  [--var-cellline VAR_CELLLINE]
                                  [--var-cellculture-meta VAR_CELLCULTURE_META]
                                  [--var-model-type VAR_MODEL_TYPE]
                                  [--var-sample-meta VAR_SAMPLE_META]
                                  [--var-a-measurement-type VAR_A_MEASUREMENT_TYPE]
                                  [--var-a-technology-type VAR_A_TECHNOLOGY_TYPE]
                                  [--var-library-kit VAR_LIBRARY_KIT]
                                  [--var-dissociation-meta VAR_DISSOCIATION_META]
                                  [--var-library-construction-meta VAR_LIBRARY_CONSTRUCTION_META]
                                  [--var-sequencing-meta VAR_SEQUENCING_META]
                                  [--var--library-types VAR__LIBRARY_TYPES]
                                  [--var-library-type VAR_LIBRARY_TYPE]
                                  [--var-sample-multiplexing VAR_SAMPLE_MULTIPLEXING]
                                  [--var-genotype-multiplexing VAR_GENOTYPE_MULTIPLEXING]
                                  [--var-s-file-name VAR_S_FILE_NAME]
                                  [--var-assay-prefix VAR_ASSAY_PREFIX]
                                  [--var-assay-name VAR_ASSAY_NAME]
                                  [--var---output-dir VAR___OUTPUT_DIR]
                                  output_dir

Positional Arguments¶

output_dir

Path to output directory

Named Arguments¶

`--var-group`	template variables ‘group’
`--var-study-title`
	template variables ‘study_title’
`--var-sample-names`
	template variables ‘sample_names’
`--var-investigation-title`
	template variables ‘investigation_title’
`--var-source-type`
	template variables ‘source_type’
`--var-cellline`	template variables ‘cellline’
`--var-cellculture-meta`
	template variables ‘cellculture_meta’
`--var-model-type`
	template variables ‘model_type’
`--var-sample-meta`
	template variables ‘sample_meta’
`--var-a-measurement-type`
	template variables ‘a_measurement_type’
`--var-a-technology-type`
	template variables ‘a_technology_type’
`--var-library-kit`
	template variables ‘library_kit’
`--var-dissociation-meta`
	template variables ‘dissociation_meta’
`--var-library-construction-meta`
	template variables ‘library_construction_meta’
`--var-sequencing-meta`
	template variables ‘sequencing_meta’
`--var--library-types`
	template variables ‘_library_types’
`--var-library-type`
	template variables ‘library_type’
`--var-sample-multiplexing`
	template variables ‘sample_multiplexing’
`--var-genotype-multiplexing`
	template variables ‘genotype_multiplexing’
`--var-s-file-name`
	template variables ‘s_file_name’
`--var-assay-prefix`
	template variables ‘assay_prefix’
`--var-assay-name`
	template variables ‘assay_name’
`--var---output-dir`
	template variables ‘__output_dir’

isa-tab¶

ISA-tab tools besides templating.

cubi-tk isa-tab [-h] {add-ped,resolve-hpo,annotate,validate} ...

Positional Arguments¶

isa_tab_cmd

Possible choices: add-ped, resolve-hpo, annotate, validate

Sub-commands¶

add-ped¶

Add records from PED file to ISA-tab

cubi-tk isa-tab add-ped [-h] [--sample-name-normalization {snappy,none}]
                        [--yes] [--dry-run] [--no-show-diff]
                        [--show-diff-side-by-side] [--batch-no BATCH_NO]
                        [--library-type {WES,WGS,Panel_seq}]
                        [--library-layout {SINGLE,PAIRED}]
                        [--library-kit LIBRARY_KIT]
                        [--library-kit-catalogue-id LIBRARY_KIT_CATALOGUE_ID]
                        [--platform PLATFORM]
                        [--instrument-model INSTRUMENT_MODEL]
                        investigation.tsv pedigree.ped

Positional Arguments¶

`investigation.tsv`
	Path to ISA-tab investigation file.
`pedigree.ped`	Path to PLINK PED file with records to add.

Named Arguments¶

`--sample-name-normalization`
	Possible choices: snappy, none Normalize sample names, default: snappy, choices: snappy, none Default: “snappy”
`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--no-show-diff, -D`
	Don’t show change when creating/updating sample sheets. Default: True
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False
`--batch-no`	Value to set as the batch number. Default: “.”
`--library-type`	Possible choices: WES, WGS, Panel_seq The library type. Default: “WES”
`--library-layout`
	Possible choices: SINGLE, PAIRED The library layout. Default: “PAIRED”
`--library-kit`	The library kit used. Default: “”
`--library-kit-catalogue-id`
	The library kit catalogue ID. Default: “”
`--platform`	The string to use for the platform Default: “ILLUMINA”
`--instrument-model`
	The string to use for the instrument model Default: “”

resolve-hpo¶

Resolve HPO term lists to ISA-tab fragments

cubi-tk isa-tab resolve-hpo [-h] [--hpo-obo-url HPO_OBO_URL] [term_file]

Positional Arguments¶

term_file

Path to ISA-tab investigation file.

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

Named Arguments¶

--hpo-obo-url

Default URL to OBO file.

Default: “http://purl.obolibrary.org/obo/hp.obo”

annotate¶

Add annotation from CSV file to ISA-tab

cubi-tk isa-tab annotate [-h] [--yes] [--dry-run] [--no-show-diff]
                         [--show-diff-side-by-side] [--force-update]
                         [--target-study s_study.tsv]
                         [--target-assay a_assay.tsv]
                         investigation.tsv annotation.tsv

Positional Arguments¶

`investigation.tsv`
	Path to ISA-tab investigation file.
`annotation.tsv`	Path to annotation (TSV) file with information to add.

Named Arguments¶

`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--no-show-diff, -D`
	Don’t show change when creating/updating sample sheets. Default: True
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False
`--force-update`	Overwrite non-empty ISA-tab entries. Default: False
`--target-study, -s`
	File name study to annotate. If not provided, first study in investigation is used.
`--target-assay, -a`
	File name of assay to annotate. If not provided, first assay in investigation is used.

validate¶

Validate ISA-tab

cubi-tk isa-tab validate [-h] [--show-duplicate-warnings] investigation.tsv

Positional Arguments¶

`investigation.tsv`
	Path to ISA-tab investigation file.

Named Arguments¶

--show-duplicate-warnings

Show duplicated warnings, i.e. with same message and same category (False by default)

Default: False

snappy¶

Tools for supporting the SNAPPY pipeline.

cubi-tk snappy [-h]
               {check-local,check-remote,itransfer-raw-data,itransfer-ngs-mapping,itransfer-variant-calling,itransfer-step,pull-sheets,pull-all-data,pull-processed-data,pull-raw-data,varfish-upload,kickoff}
               ...

Positional Arguments¶

snappy_cmd

Possible choices: check-local, check-remote, itransfer-raw-data, itransfer-ngs-mapping, itransfer-variant-calling, itransfer-step, pull-sheets, pull-all-data, pull-processed-data, pull-raw-data, varfish-upload, kickoff

Sub-commands¶

check-local¶

Check consistency within local sample sheet and between local sheets and files

cubi-tk snappy check-local [-h] [--tsv-shortcut {germline,cancer}]
                           [--base-path BASE_PATH]
                           [project_uuids [project_uuids ...]]

Positional Arguments¶

project_uuids

UUID(s) from project(s) to check. Use all if not given.

Named Arguments¶

--tsv-shortcut

Possible choices: germline, cancer

The shortcut TSV schema to use.

Default: “germline”

--base-path

Base path of project (contains ‘ngs_mapping/’ etc.), spiders up from biomedsheet_tsv and falls back to current working directory by default.

Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”

check-remote¶

Check consistency within remote sample sheet and files

cubi-tk snappy check-remote [-h] [--sodar-url SODAR_URL]
                            [--sodar-api-token SODAR_API_TOKEN]
                            [--tsv-shortcut {cancer,generic,germline}]
                            [--base-path BASE_PATH] [--md5]
                            [--assay-uuid ASSAY_UUID]
                            project_uuid

Positional Arguments¶

project_uuid

UUID from Project to check.

Named Arguments¶

`--sodar-url`	URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/ Default: “https://sodar.bihealth.org/”
`--sodar-api-token`
	Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.
`--tsv-shortcut`	Possible choices: cancer, generic, germline The shortcut TSV schema to use. Default: “germline”
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), spiders up from biomedsheet_tsv and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--md5`	Flag to indicate if local and remote MD5 files should be compared. Default: False
`--assay-uuid`	UUID from Assay to check. Used to specify target while dealing with multi-assay projects.

itransfer-raw-data¶

Transfer FASTQs into iRODS landing zone

cubi-tk snappy itransfer-raw-data [-h] [--sodar-url SODAR_URL]
                                  [--sodar-api-token SODAR_API_TOKEN]
                                  [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                                  [--tsv-shortcut {germline,cancer}]
                                  [--first-batch FIRST_BATCH]
                                  [--last-batch LAST_BATCH]
                                  [--base-path BASE_PATH]
                                  [--remote-dir-date REMOTE_DIR_DATE]
                                  [--remote-dir-pattern REMOTE_DIR_PATTERN]
                                  [--yes] [--validate-and-move]
                                  [--assay ASSAY]
                                  destination

Positional Arguments¶

destination

UUID from Landing Zone or Project - where files will be moved to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--tsv-shortcut`	Possible choices: germline, cancer The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern Default: “{library_name}/{step}/{date}”
`--yes`	Assume all answers are yes, e.g., will create or use existing available landing zones without asking. Default: False
`--validate-and-move`
	After files are transferred to SODAR, it will proceed with validation and move. Default: False
`--assay`	UUID of assay to download data for.

itransfer-ngs-mapping¶

Transfer ngs_mapping results into iRODS landing zone

cubi-tk snappy itransfer-ngs-mapping [-h] [--sodar-url SODAR_URL]
                                     [--sodar-api-token SODAR_API_TOKEN]
                                     [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                                     [--tsv-shortcut {germline,cancer}]
                                     [--first-batch FIRST_BATCH]
                                     [--last-batch LAST_BATCH]
                                     [--base-path BASE_PATH]
                                     [--remote-dir-date REMOTE_DIR_DATE]
                                     [--remote-dir-pattern REMOTE_DIR_PATTERN]
                                     [--yes] [--validate-and-move]
                                     [--assay ASSAY] [--mapper MAPPER]
                                     destination

Positional Arguments¶

destination

UUID from Landing Zone or Project - where files will be moved to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--tsv-shortcut`	Possible choices: germline, cancer The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern Default: “{library_name}/{step}/{date}”
`--yes`	Assume all answers are yes, e.g., will create or use existing available landing zones without asking. Default: False
`--validate-and-move`
	After files are transferred to SODAR, it will proceed with validation and move. Default: False
`--assay`	UUID of assay to download data for.
`--mapper`	Name of the mapper to transfer for, defaults to bwa. Default: “bwa”

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

itransfer-variant-calling¶

Transfer variant_calling results into iRODS landing zone

cubi-tk snappy itransfer-variant-calling [-h] [--sodar-url SODAR_URL]
                                         [--sodar-api-token SODAR_API_TOKEN]
                                         [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                                         [--tsv-shortcut {germline,cancer}]
                                         [--first-batch FIRST_BATCH]
                                         [--last-batch LAST_BATCH]
                                         [--base-path BASE_PATH]
                                         [--remote-dir-date REMOTE_DIR_DATE]
                                         [--remote-dir-pattern REMOTE_DIR_PATTERN]
                                         [--yes] [--validate-and-move]
                                         [--assay ASSAY] [--mapper MAPPER]
                                         [--caller CALLER]
                                         destination

Positional Arguments¶

destination

UUID from Landing Zone or Project - where files will be moved to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--tsv-shortcut`	Possible choices: germline, cancer The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern Default: “{library_name}/{step}/{date}”
`--yes`	Assume all answers are yes, e.g., will create or use existing available landing zones without asking. Default: False
`--validate-and-move`
	After files are transferred to SODAR, it will proceed with validation and move. Default: False
`--assay`	UUID of assay to download data for.
`--mapper`	Name of the mapper to transfer for, defaults to bwa. Default: “bwa”
`--caller`	Name of the variant caller to transfer for, defaults to gatk_hc Default: “gatk_hc”

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

itransfer-step¶

Transfer snappy step results into iRODS landing zone

cubi-tk snappy itransfer-step [-h] [--sodar-url SODAR_URL]
                              [--sodar-api-token SODAR_API_TOKEN]
                              [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                              [--tsv-shortcut {germline,cancer}]
                              [--first-batch FIRST_BATCH]
                              [--last-batch LAST_BATCH]
                              [--base-path BASE_PATH]
                              [--remote-dir-date REMOTE_DIR_DATE]
                              [--remote-dir-pattern REMOTE_DIR_PATTERN]
                              [--yes] [--validate-and-move] [--assay ASSAY]
                              [--step STEP] [--tool [TOOL [TOOL ...]]]
                              destination

Positional Arguments¶

destination

UUID from Landing Zone or Project - where files will be moved to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--tsv-shortcut`	Possible choices: germline, cancer The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern Default: “{library_name}/{step}/{date}”
`--yes`	Assume all answers are yes, e.g., will create or use existing available landing zones without asking. Default: False
`--validate-and-move`
	After files are transferred to SODAR, it will proceed with validation and move. Default: False
`--assay`	UUID of assay to download data for.
`--step`	Name of the snappy pipeline step (step name must be identical to step directory).Steps names are available from the snappy command snappy-start-step –help
`--tool`	Name of the tool, for example bwa. Tools order in important:it must match the order used to generate filename prefix.For example, the variant annotation step requires the mapper, caller andthe annotator software. In that case, the snappy file prefix is:<mapper>.<caller>.<annotator>, so the command would be:–tool <mapper> <vcaller> <annotator>. Some steps add more information to theirprefix, for example ‘jannovar_somatic_vcf’

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

pull-sheets¶

Pull SODAR sample sheets into biomedsheet

cubi-tk snappy pull-sheets [-h] [--base-path BASE_PATH] [--yes] [--dry-run]
                           [--no-show-diff] [--show-diff-side-by-side]
                           [--library-types LIBRARY_TYPES]
                           [--first-batch FIRST_BATCH]
                           [--last-batch LAST_BATCH]
                           [--tsv-shortcut {cancer,generic,germline}]

Named Arguments¶

`--base-path`	Base path of project (contains ‘.snappy_pipeline/’ etc.), spiders up from current work directory and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--no-show-diff, -D`
	Don’t show change when creating/updating sample sheets. Default: True
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False
`--library-types`
	Library type(s) to use, comma-separated, default is to use all.
`--first-batch`	First batch to be included in local sample sheet. Defaults: 0. Default: 0
`--last-batch`	Last batch to be included in local sample sheet. Not used by default.
`--tsv-shortcut`	Possible choices: cancer, generic, germline The shortcut TSV schema to use; default: ‘germline’. Default: “germline”

pull-all-data¶

Pull all data from SODAR to specified output directory

cubi-tk snappy pull-all-data [-h] [--base-path BASE_PATH]
                             [--sodar-url SODAR_URL]
                             [--sodar-api-token SODAR_API_TOKEN]
                             --output-directory OUTPUT_DIRECTORY [--overwrite]
                             [--first-batch FIRST_BATCH] [--samples SAMPLES]
                             [--allow-missing] [--yes] [--dry-run]
                             [--irsync-threads IRSYNC_THREADS]
                             [--assay ASSAY_UUID]
                             project_uuid

Positional Arguments¶

project_uuid

UUID of project to download data for.

Named Arguments¶

`--base-path`	Base path of project (contains ‘.snappy_pipeline/’ etc.), spiders up from current work directory and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--output-directory`
	Output directory, where downloaded files will be stored.
`--overwrite`	Allow overwriting of files Default: False
`--first-batch`	First batch number to pull Default: 0
`--samples`	Optional list of samples to pull
`--allow-missing`
	Allow missing data in assay Default: False
`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--irsync-threads`
	Parameter -N to pass to irsync
`--assay`	UUID of assay to create landing zone for.

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

pull-processed-data¶

Pull processed data from SODAR to specified output directory

cubi-tk snappy pull-processed-data [-h] [--sodar-url SODAR_URL]
                                   [--sodar-api-token SODAR_API_TOKEN]
                                   [--tsv-shortcut {cancer,generic,germline}]
                                   [--base-path BASE_PATH]
                                   [--selected-samples SELECTED_SAMPLES]
                                   [--first-batch FIRST_BATCH]
                                   [--last-batch LAST_BATCH]
                                   --output-directory OUTPUT_DIRECTORY
                                   [--sample-id] --file-type
                                   {bam,vcf,txt,csv,log}
                                   [--download-all-versions] [--overwrite]
                                   [--assay-uuid ASSAY_UUID]
                                   project_uuid

Positional Arguments¶

project_uuid

UUID from Project to check.

Named Arguments¶

`--sodar-url`	URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/ Default: “https://sodar.bihealth.org/”
`--sodar-api-token`
	Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.
`--tsv-shortcut`	Possible choices: cancer, generic, germline The shortcut TSV schema to use; default: ‘germline’. Default: “germline”
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), spiders up from biomedsheet_tsv and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--selected-samples`
	Limits the request to the listed sample names. Don’t include the full library name, just the sample name (e.g., ‘P001’ instead of ‘P001-N1-DNA1-WES1’). Separate the sample with comma for multiple samples, example: ‘P001,P002,P003’. Note: argument overrides batch related arguments.
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--output-directory`
	Output directory, where downloaded files will be stored.
`--sample-id`	Flag to indicate if search should be based on sample identifier (e.g.’P001’) instead of library name (e.g. ‘P001-N1-DNA1-WGS1’). Default: False
`--file-type`	Possible choices: bam, vcf, txt, csv, log File extensions to be retrieved. Valid options: (‘bam’, ‘vcf’, ‘txt’, ‘csv’, ‘log’)
`--download-all-versions`
	By default only the latest version of a file will be download. For instance, if a was uploaded two times, in ‘2022-01-31’ and ‘2022-02-28’, only the latest is downloaded. If this flag is present, both versions will be downloaded. Default: False
`--overwrite`	Allow overwriting of local files. Default: False
`--assay-uuid`	UUID from Assay to check. Used to specify target while dealing with multi-assay projects.

pull-raw-data¶

Pull raw data from SODAR to SNAPPY dataset raw data directory

cubi-tk snappy pull-raw-data [-h] [--base-path BASE_PATH]
                             [--sodar-url SODAR_URL]
                             [--sodar-api-token SODAR_API_TOKEN] [--dry-run]
                             [--overwrite]
                             [--tsv-shortcut {cancer,generic,germline}]
                             [--first-batch FIRST_BATCH]
                             [--last-batch LAST_BATCH] [--samples SAMPLES]
                             [--use-library-name] [--assay-uuid ASSAY_UUID]
                             project_uuid

Positional Arguments¶

project_uuid

UUID of project to download data for.

Named Arguments¶

`--base-path`	Base path of project (contains ‘.snappy_pipeline/’ etc.), spiders up from current work directory and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--dry-run, -n`	Perform a dry run, i.e., just displays the files that would be downloaded. Default: False
`--overwrite`	Allow overwriting of files Default: False
`--tsv-shortcut`	Possible choices: cancer, generic, germline The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--samples`	Optional list of samples to pull
`--use-library-name`
	Flag to indicate that the search in SODAR directories should be based on library name (e.g. ‘P001-N1-DNA1-WGS1’) instead of sample identifier (e.g.’P001’) in the file name. Default: False
`--assay-uuid`	UUID of assay to create landing zone for.

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

varfish-upload¶

Upload variant analysis results into VarFish

cubi-tk snappy varfish-upload [-h] [--varfish-config VARFISH_CONFIG]
                              [--varfish-server-url VARFISH_SERVER_URL]
                              [--varfish-api-token VARFISH_API_TOKEN]
                              [--base-path BASE_PATH] [--steps STEPS]
                              [--external-data] [--min-batch MIN_BATCH]
                              [--yes] [--samples SAMPLES]
                              project [project ...]

Positional Arguments¶

project

The UUID(s) of the SODAR project to submit.

Named Arguments¶

`--base-path`	Base path of project (contains ‘.snappy_pipeline/’ etc.), spiders up from current work directory and falls back to current working directory by default. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--steps`	Pipeline steps to consider for the export. Defaults to include all of the following; specify this with +name/-name to add/remove and either give multiple arguments or use a comma-separated list. {ngs_mapping, targeted_seq_cnv_export, varfish_export, variant_export, variant_export_external, wgs_cnv_export, wgs_cnv_export_external, wgs_sv_export, wgs_sv_export_external} Default: []
`--external-data`
	Flag to indicate that data was externally generated. Search for files will not filter based on common internally tool combinations, example: ‘bwa.delly2’ or ‘bwa.gatk_hc’. Default: False
`--min-batch`	Smallest batch to transfer, keep empty to transfer all.
`--yes, -y`	Assume yes to all answers Default: False
`--samples`	Limits the submission to the listed sample names. Don’t include the full library name just the sample name (e.g., ‘P001’ instead of ‘P001-N1-DNA1-WES1’). Separate the sample with comma for multiple samples, example: ‘P001,P002,P003’. Default: “”

VarFish Configuration¶

`--varfish-config`
	Path to configuration file.
`--varfish-server-url`
	SODAR server URL key to use, defaults to env VARFISH_SERVER_URL.
`--varfish-api-token`
	SODAR API token to use, defaults to env VARFISH_API_TOKEN.

kickoff¶

Kick-off SNAPPY pipeline steps.

cubi-tk snappy kickoff [-h] [--dry-run] [--timeout TIMEOUT] [path]

Positional Arguments¶

path

Path into SNAPPY directory (below a directory containing .snappy_pipeline).

Named Arguments¶

--dry-run, -n

Perform dry-run, do not do anything.

Default: False

--timeout

Number of seconds to wait for commands.

Default: 10

sodar¶

SODAR command line interface.

cubi-tk sodar [-h]
              {add-ped,download-sheet,upload-sheet,pull-raw-data,landing-zone-create,landing-zone-list,landing-zone-move,ingest-fastq,check-remote}
              ...

Positional Arguments¶

sodar_cmd

Possible choices: add-ped, download-sheet, upload-sheet, pull-raw-data, landing-zone-create, landing-zone-list, landing-zone-move, ingest-fastq, check-remote

Sub-commands¶

add-ped¶

Augment sample sheet from PED file

cubi-tk sodar add-ped [-h] [--sodar-url SODAR_URL]
                      [--sodar-api-token SODAR_API_TOKEN] [--dry-run]
                      [--show-diff] [--show-diff-side-by-side]
                      [--sample-name-normalization {snappy,none}] [--yes]
                      [--batch-no BATCH_NO]
                      [--library-type {WES,WGS,Panel_seq}]
                      [--library-layout {SINGLE,PAIRED}]
                      [--library-kit LIBRARY_KIT]
                      [--library-kit-catalogue-id LIBRARY_KIT_CATALOGUE_ID]
                      [--platform PLATFORM]
                      [--instrument-model INSTRUMENT_MODEL]
                      project_uuid pedigree.ped

Positional Arguments¶

`project_uuid`	UUID of project to download the ISA-tab for.
`pedigree.ped`	Path to PLINK PED file with records to add.

Named Arguments¶

`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--show-diff, -D`
	Show change when creating/updating sample sheets. Default: False
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False
`--sample-name-normalization`
	Possible choices: snappy, none Normalize sample names, default: snappy, choices: snappy, none Default: “snappy”
`--yes`	Assume all answers are yes. Default: False
`--batch-no`	Value to set as the batch number. Default: “.”
`--library-type`	Possible choices: WES, WGS, Panel_seq The library type. Default: “WES”
`--library-layout`
	Possible choices: SINGLE, PAIRED The library layout. Default: “PAIRED”
`--library-kit`	The library kit used. Default: “”
`--library-kit-catalogue-id`
	The library kit catalogue ID. Default: “”
`--platform`	The string to use for the platform Default: “ILLUMINA”
`--instrument-model`
	The string to use for the instrument model Default: “”

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

download-sheet¶

Download ISA-tab

cubi-tk sodar download-sheet [-h] [--sodar-url SODAR_URL]
                             [--sodar-api-token SODAR_API_TOKEN]
                             [--no-makedirs] [--overwrite] [--yes] [--dry-run]
                             [--show-diff] [--show-diff-side-by-side]
                             project_uuid output_dir

Positional Arguments¶

`project_uuid`	UUID of project to download the ISA-tab for.
`output_dir`	Path to output directory to write the sheet to.

Named Arguments¶

`--no-makedirs`	Create output directories Default: True
`--overwrite`	Allow overwriting of files Default: False
`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--show-diff, -D`
	Show change when creating/updating sample sheets. Default: False
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

upload-sheet¶

Upload and replace ISA-tab

cubi-tk sodar upload-sheet [-h] [--sodar-url SODAR_URL]
                           [--sodar-api-token SODAR_API_TOKEN]
                           project_uuid input_investigation_file

Positional Arguments¶

`project_uuid`	UUID of project to upload the ISA-tab for.
`input_investigation_file`
	Path to input investigation file.

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

pull-raw-data¶

Download raw data from iRODS

cubi-tk sodar pull-raw-data [-h] [--sodar-url SODAR_URL]
                            [--sodar-api-token SODAR_API_TOKEN] [--overwrite]
                            [--min-batch MIN_BATCH] [--allow-missing] [--yes]
                            [--dry-run] [--irsync-threads IRSYNC_THREADS]
                            [--assay ASSAY]
                            project_uuid output_dir

Positional Arguments¶

`project_uuid`	UUID of project to download data for.
`output_dir`	Path to output directory to write the raw data to.

Named Arguments¶

`--overwrite`	Allow overwriting of files Default: False
`--min-batch`	Minimal batch number to pull Default: 0
`--allow-missing`
	Allow missing data in assay Default: False
`--yes`	Assume all answers are yes. Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--irsync-threads`
	Parameter -N to pass to irsync
`--assay`	UUID of assay to download data for.

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

landing-zone-create¶

Creating landing zone

cubi-tk sodar landing-zone-create [-h] [--sodar-url SODAR_URL]
                                  [--sodar-api-token SODAR_API_TOKEN]
                                  [--unless-exists] [--dry-run]
                                  [--assay ASSAY] [--format FORMAT_STRING]
                                  project_uuid

Positional Arguments¶

project_uuid

UUID of project to create the landing zone in.

Named Arguments¶

`--unless-exists`
	If there already is a landing zone in the current project then use this one Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--assay`	UUID of assay to create landing zone for.
`--format`	Format string for printing, e.g. %(uuid)s

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

landing-zone-list¶

List landing zones

cubi-tk sodar landing-zone-list [-h] [--sodar-url SODAR_URL]
                                [--sodar-api-token SODAR_API_TOKEN]
                                [--unless-exists] [--dry-run]
                                [--format FORMAT_STRING]
                                project_uuid

Positional Arguments¶

project_uuid

UUID of project to create the landing zone in.

Named Arguments¶

`--unless-exists`
	If there already is a landing zone in the current project then use this one Default: False
`--dry-run, -n`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--format`	Format string for printing, e.g. %(uuid)s

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

landing-zone-move¶

Submit landing zone for moving

cubi-tk sodar landing-zone-move [-h] [--sodar-url SODAR_URL]
                                [--sodar-api-token SODAR_API_TOKEN]
                                [--dry-run] [--format FORMAT_STRING]
                                landing_zone_uuid

Positional Arguments¶

`landing_zone_uuid`
	UUID of landing zone to move.

Named Arguments¶

--dry-run, -n

Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’.

Default: False

--format

Format string for printing, e.g. %(uuid)s

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

ingest-fastq¶

Upload external files to SODAR (defaults for fastq)

cubi-tk sodar ingest-fastq [-h] [--sodar-url SODAR_URL]
                           [--sodar-api-token SODAR_API_TOKEN]
                           [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                           [--yes] [--base-path BASE_PATH]
                           [--remote-dir-date REMOTE_DIR_DATE]
                           [--src-regex SRC_REGEX]
                           [--remote-dir-pattern REMOTE_DIR_PATTERN]
                           [--add-suffix ADD_SUFFIX] [-m MATCH REPL]
                           [--tmp TMP]
                           sources [sources ...] destination

Positional Arguments¶

`sources`	paths to fastq folders
`destination`	UUID or iRods path of landing zone to move to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--yes`	Assume the answer to all prompts is ‘yes’ Default: False
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--src-regex`	Regular expression to use for matching input fastq files, default: (./)?(?P<sample>.+?)(?:_(?P<lane>L[0-9]+?))?(?:_(?P<mate>R[0-9]+?))?(?:_(?P<batch>[0-9]+?))?.f(?:ast)?q.gz Default: “(./)?(?P<sample>.+?)(?:_(?P<lane>L[0-9]+?))?(?:_(?P<mate>R[0-9]+?))?(?:_(?P<batch>[0-9]+?))?.f(?:ast)?q.gz”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern, default: {sample}/{date}/{filename} Default: “{sample}/{date}/{filename}”
`--add-suffix`	Suffix to add to all file names (e.g. ‘-N1-DNA1-WES1’). Default: “”
`-m, --remote-dir-mapping`
	Substitutions applied to the filled remote dir paths. Can for example be used to modify sample names. Use pythons regex syntax of ‘re.sub’ package. This argument can be used multiple times (i.e. ‘-m <regex1> <repl1> -m <regex2> <repl2>’ …). Default: []
`--tmp`	Folder to save files from WebDAV temporarily, if set as source. Default: “temp/”

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

check-remote¶

Compare local files with md5 sum against SODAR/iRODS

cubi-tk sodar check-remote [-h] [--sodar-url SODAR_URL]
                           [--sodar-api-token SODAR_API_TOKEN] [-p BASE_PATH]
                           [--filename-only] [--recheck-md5]
                           [--report-md5sums] [--assay-uuid ASSAY_UUID]
                           project_uuid

Positional Arguments¶

project_uuid

UUID from Project to check.

Named Arguments¶

`--sodar-url`	URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/ Default: “https://sodar.bihealth.org/”
`--sodar-api-token`
	Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.
`-p, --base-path`
	Base path in which local files with md5 sums should be identified. Default: CWD Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--filename-only`
	Flag to indicate whether file comparison between local and remote files should only use file names and ignore md5 values. Default: False
`--recheck-md5`	Flag to double check that md5 sums stored in local files do actually match their corresponding files Default: False
`--report-md5sums`
	Flag to indicate if md5 sums should be included in file report Default: False
`--assay-uuid`	UUID from Assay to check. Used to specify target while dealing with multi-assay projects.

irods¶

iRods command line interface.

cubi-tk irods [-h] {check} ...

Positional Arguments¶

irods_cmd

Possible choices: check

Sub-commands¶

check¶

Check target iRODS collection (all MD5 files? metadata MD5 consistent? enough replicas?).

cubi-tk irods check [-h] [-r REQ_NUM_REPS] [-p NUM_PARALLEL_TESTS]
                    [-d NUM_DISPLAY_FILES] [-s HASH_SCHEME]
                    irods_path

Positional Arguments¶

irods_path

Path to an iRODS collection.

Named Arguments¶

`-r, --num-replicas`
	Minimum number of replicas, defaults to 2 Default: 2
`-p, --num-parallel-tests`
	Number of parallel tests, defaults to 4 Default: 4
`-d, --num-display-files`
	Number of files listed when checking, defaults to 20 Default: 20
`-s, --hash-scheme`
	Hash scheme used to verify checksums, defaults to MD5 Default: “MD5”

org-raw¶

org_raw command line interface.

cubi-tk org-raw [-h] {check,organize} ...

Positional Arguments¶

org_raw_cmd

Possible choices: check, organize

Sub-commands¶

check¶

Check consistency of raw data

cubi-tk org-raw check [-h] [--num-threads NUM_THREADS] [--no-gz-check]
                      [--no-md5-check] [--no-compute-md5]
                      [--missing-md5-error] [--create-md5-fail-no-error]
                      FILE.fastq.gz [FILE.fastq.gz ...]

Positional Arguments¶

FILE.fastq.gz

Path(s) to .fastq.gz files to perform the check for

Named Arguments¶

`--num-threads`	Number of parallel threads Default: 0
`--no-gz-check`	Deactivate check for gzip consistency (default is to perform check). Default: True
`--no-md5-check`	Deactivate comparison of MD5 sum if .md5 file exists (default is to perform check). Default: True
`--no-compute-md5`
	Deactivate computation of MD5 sum if missing (default is to compute MD5 sum). Default: True
`--missing-md5-error`
	Make missing .md5 files constitute an error. Default is to issue an log message only. Default: False
`--create-md5-fail-no-error`
	Make failure to create .md5 file not an error. Default is to make it an error. Default: True

organize¶

Check consistency of raw data

cubi-tk org-raw organize [-h] [--dry-run] [--yes] [--move] [--no-check]
                         [--src-regex SRC_REGEX] [--dest-pattern DEST_PATTERN]
                         [--num-threads NUM_THREADS] [--no-gz-check]
                         [--no-md5-check] [--no-compute-md5]
                         [--missing-md5-error] [--create-md5-fail-no-error]
                         out_path path.fastq.gz [path.fastq.gz ...]

Positional Arguments¶

`out_path`	Path to output directory.
`path.fastq.gz`	Path to input files.

Named Arguments¶

`--dry-run`	Dry-run, do not actually do anything Default: False
`--yes`	Assume the answer to all prompts is ‘yes’ Default: False
`--move`	Move file(s) instead of copying, default is to copy. Default: False
`--no-check`	Do not run ‘raw-org check’ on output (default is to run). Default: True
`--src-regex`	Regular expression for parsing file paths. Default: (./)?(?P<sample>.+)(?:-.+?)?.f(?:ast)?q.gz Default: “(./)?(?P<sample>.+)(?:-.+?)?.f(?:ast)?q.gz”
`--dest-pattern`	Format expression for destination path generation. Default: {sample_name}/{file_name} Default: “{sample_name}/{file_name}”
`--num-threads`	Number of parallel threads Default: 0
`--no-gz-check`	Deactivate check for gzip consistency (default is to perform check). Default: True
`--no-md5-check`	Deactivate comparison of MD5 sum if .md5 file exists (default is to perform check). Default: True
`--no-compute-md5`
	Deactivate computation of MD5 sum if missing (default is to compute MD5 sum). Default: True
`--missing-md5-error`
	Make missing .md5 files constitute an error. Default is to issue an log message only. Default: False
`--create-md5-fail-no-error`
	Make failure to create .md5 file not an error. Default is to make it an error. Default: True

sea-snap¶

Tools for supporting the RNA-SeASnaP pipeline.

cubi-tk sea-snap [-h]
                 {itransfer-raw-data,itransfer-results,working-dir,write-sample-info,check-irods}
                 ...

Positional Arguments¶

sea_snap_cmd

Possible choices: itransfer-raw-data, itransfer-results, working-dir, write-sample-info, check-irods

Sub-commands¶

itransfer-raw-data¶

Transfer FASTQs into iRODS landing zone

cubi-tk sea-snap itransfer-raw-data [-h] [--sodar-url SODAR_URL]
                                    [--sodar-api-token SODAR_API_TOKEN]
                                    [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                                    [--tsv-shortcut {germline,cancer}]
                                    [--first-batch FIRST_BATCH]
                                    [--last-batch LAST_BATCH]
                                    [--base-path BASE_PATH]
                                    [--remote-dir-date REMOTE_DIR_DATE]
                                    [--remote-dir-pattern REMOTE_DIR_PATTERN]
                                    [--yes] [--validate-and-move]
                                    [--assay ASSAY]
                                    destination

Positional Arguments¶

destination

UUID from Landing Zone or Project - where files will be moved to.

Named Arguments¶

`--num-parallel-transfers`
	Number of parallel transfers, defaults to 8 Default: 8
`--tsv-shortcut`	Possible choices: germline, cancer The shortcut TSV schema to use. Default: “germline”
`--first-batch`	First batch to be transferred. Defaults: 0. Default: 0
`--last-batch`	Last batch to be transferred.
`--base-path`	Base path of project (contains ‘ngs_mapping/’ etc.), defaults to current path. Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”
`--remote-dir-date`
	Date to use in remote directory, defaults to YYYY-MM-DD of today. Default: “2023-07-04”
`--remote-dir-pattern`
	Pattern to use for constructing remote pattern Default: “{library_name}/{step}/{date}”
`--yes`	Assume all answers are yes, e.g., will create or use existing available landing zones without asking. Default: False
`--validate-and-move`
	After files are transferred to SODAR, it will proceed with validation and move. Default: False
`--assay`	UUID of assay to download data for.

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

itransfer-results¶

Transfer mapping results into iRODS landing zone

cubi-tk sea-snap itransfer-results [-h] [--sodar-url SODAR_URL]
                                   [--sodar-api-token SODAR_API_TOKEN]
                                   [--num-parallel-transfers NUM_PARALLEL_TRANSFERS]
                                   transfer_blueprint destination

Positional Arguments¶

`transfer_blueprint`
	Path to blueprint file to load. This file contains commands to sync files with iRODS. Blocks of commands separated by an empty line will be executed together in one thread.
`destination`	UUID or iRods path of landing zone to move to.

Named Arguments¶

--num-parallel-transfers

Number of parallel transfers, defaults to 8

Default: 8

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-api-token

Authentication token when talking to SODAR. Defaults to SODAR_API_TOKEN environment variable.

working-dir¶

Create working directory

cubi-tk sea-snap working-dir [-h] [--dry-run] [--dirname DIRNAME]
                             [--configs {mapping,DE} [{mapping,DE} ...]]
                             [sea_snap_path]

Positional Arguments¶

sea_snap_path

Path into RNA-SeA-SnaP directory (below a directory containing ‘mapping_pipeline.snake’).

Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/docs_manual”

Named Arguments¶

--dry-run, -n

Perform dry-run, do not do anything.

Default: False

--dirname, -d

Name of the working directory to create (default: ‘results_YEAR_MONTH_DAY/’).

Default: “results_%Y_%m_%d/”

--configs, -c

Possible choices: mapping, DE

Configs to be imported (default: all).

Default: [‘mapping’, ‘DE’]

write-sample-info¶

Generate sample info file

cubi-tk sea-snap write-sample-info [-h] [--allow-overwrite] [--dry-run]
                                   [--show-diff] [--show-diff-side-by-side]
                                   [--from-file FROM_FILE]
                                   [--isa-assay ISA_ASSAY]
                                   [--project_uuid PROJECT_UUID]
                                   [--output_folder OUTPUT_FOLDER]
                                   [--overwrite-isa] [--sodar-url SODAR_URL]
                                   [--sodar-auth-token SODAR_AUTH_TOKEN]
                                   in_path_pattern [output_file]

Positional Arguments¶

in_path_pattern

Path pattern to use for extracting input file information. See https://cubi-gitlab.bihealth.org/CUBI/Pipelines/sea-snap/blob/master/documentation/prepare_input.md#fastq-files-folder-structure.

output_file

Filename ending with ‘.yaml’ or ‘.tsv’. default: sample_info.yaml.

Default: sample_info.yaml

Named Arguments¶

`--allow-overwrite`
	Allow to overwrite output file, default is not to allow overwriting output file. Default: False
`--dry-run`	Perform a dry run, i.e., don’t change anything only display change, implies ‘–show-diff’. Default: False
`--show-diff`	Show change when creating/updating sample sheets. Default: False
`--show-diff-side-by-side`
	Show diff side by side instead of unified. Default: False
`--from-file`	Path to yaml file to convert to tsv or tsv to yaml. Not used, if not specified.
`--isa-assay`	Path to ISA assay file. Not used, if not specified.

pull ISA files¶

`--project_uuid`	If set pull ISA files from SODAR. UUID of project to pull from. Default: False
`--output_folder`
	Output folder path to store ISA files. Default: “ISA_files/”
`--overwrite-isa`
	Allow to overwrite output file, default is not to allow overwriting output file. Default: False

SODAR-related¶

--sodar-url

URL to SODAR, defaults to SODAR_URL environment variable or fallback to https://sodar.bihealth.org/

Default: “https://sodar.bihealth.org/”

--sodar-auth-token

Authentication token when talking to SODAR. Defaults to SODAR_AUTH_TOKEN environment variable.

check-irods¶

Check consistency of sample info, blueprint and files on SODAR

cubi-tk sea-snap check-irods [-h] [--num-replicas NUM_REPLICAS]
                             [--num-parallel-tests NUM_PARALLEL_TESTS] [--yes]
                             [--transfer-blueprint TRANSFER_BLUEPRINT]
                             results_folder irods_path

Positional Arguments¶

`results_folder`	Path to a Sea-snap results folder.
`irods_path`	Path to an iRods collection.

Named Arguments¶

`--num-replicas`	Minimum number of replicas, defaults to 2 Default: 2
`--num-parallel-tests`
	Number of parallel tests, defaults to 8 Default: 8
`--yes`	Assume the answer to all prompts is ‘yes’ Default: False
`--transfer-blueprint`
	Filename of blueprint file for export to SODAR (created e.g. with ‘./sea-snap sc l export’). Assumed to be in the results folder. Default: ‘SODAR_export_blueprint.txt’ Default: “SODAR_export_blueprint.txt”

archive¶

helper for archiving projects.

cubi-tk archive [-h] {copy,prepare,readme,summary} ...

Positional Arguments¶

archive_cmd

Possible choices: copy, prepare, readme, summary

Sub-commands¶

copy¶

Perform archival (copy and audit)

cubi-tk archive copy [-h] [--num-threads NUM_THREADS]
                     [--skip [SKIP [SKIP ...]]] [--keep-workdir-hashdeep]
                     [--read-only]
                     project destination

Positional Arguments¶

`project`	Path of project directory
`destination`	Final destination directory for archive, must not exist

Named Arguments¶

`--num-threads`	Number of parallel threads Default: 4
`--skip`	Step to skip (hashdeep, rsync, audit)
`--keep-workdir-hashdeep`
	Save hashdeep report & audit of the temporary destination Default: False
`--read-only`	Change destination files to read-only Default: False

prepare¶

Prepare the project directory for archival

cubi-tk archive prepare [-h] [--num-threads NUM_THREADS] [--rules RULES]
                        [--readme README] [--ignore-tar-errors]
                        project destination

Positional Arguments¶

`project`	Path of project directory
`destination`	Destination directory (for symlinks and later archival)

Named Arguments¶

`--num-threads`	Number of parallel threads Default: 4
`--rules, -r`	Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/cubi_tk/archive/default_rules.yaml”
`--readme`	Path to README.md created with cubi-tk
`--ignore-tar-errors`
	Ignore errors due to access permissions in when compressind folders Default: False

readme¶

Prepare a valid README.md

cubi-tk archive readme [-h] [--skip-collect] [--is-valid]
                       [--var-directory VAR_DIRECTORY]
                       [--var-PI-name VAR_PI_NAME]
                       [--var-PI-email VAR_PI_EMAIL]
                       [--var-archiver-name VAR_ARCHIVER_NAME]
                       [--var-archiver-email VAR_ARCHIVER_EMAIL]
                       [--var-CUBI-name VAR_CUBI_NAME]
                       [--var-client-name VAR_CLIENT_NAME]
                       [--var-client-email VAR_CLIENT_EMAIL]
                       [--var-SODAR-UUID VAR_SODAR_UUID]
                       [--var-SODAR-URL VAR_SODAR_URL]
                       [--var-Gitlab-URL VAR_GITLAB_URL]
                       [--var-project-name VAR_PROJECT_NAME]
                       [--var-start-date VAR_START_DATE]
                       [--var-current-status VAR_CURRENT_STATUS]
                       [--var-size VAR_SIZE] [--var-inodes VAR_INODES]
                       [--var-size-follow VAR_SIZE_FOLLOW]
                       [--var-inodes-follow VAR_INODES_FOLLOW]
                       [--var-snakemake-nb VAR_SNAKEMAKE_NB]
                       project filename

Positional Arguments¶

`project`	Path of project directory
`filename`	README.md path & filename

Named Arguments¶

`--skip-collect, -s`
	Skip the collection of file size & inodes Default: False
`--is-valid, -t`	Test validity of existing README file Default: False
`--var-directory`
	template variable ‘directory’
`--var-PI-name`	template variable ‘PI_name’
`--var-PI-email`	template variable ‘PI_email’
`--var-archiver-name`
	template variable ‘archiver_name’
`--var-archiver-email`
	template variable ‘archiver_email’
`--var-CUBI-name`
	template variable ‘CUBI_name’
`--var-client-name`
	template variable ‘client_name’
`--var-client-email`
	template variable ‘client_email’
`--var-SODAR-UUID`
	template variable ‘SODAR_UUID’
`--var-SODAR-URL`
	template variable ‘SODAR_URL’
`--var-Gitlab-URL`
	template variable ‘Gitlab_URL’
`--var-project-name`
	template variable ‘project_name’
`--var-start-date`
	template variable ‘start_date’
`--var-current-status`
	template variable ‘current_status’
`--var-size`	template variable ‘size’
`--var-inodes`	template variable ‘inodes’
`--var-size-follow`
	template variable ‘size_follow’
`--var-inodes-follow`
	template variable ‘inodes_follow’
`--var-snakemake-nb`
	template variable ‘snakemake_nb’

summary¶

Collects a summary of files in the project directory. The summary can be saved to a file for further inspection

cubi-tk archive summary [-h] [--classes CLASSES] [--dont-follow-links]
                        project table

Positional Arguments¶

`project`	Path of project directory
`table`	Location of the summary output table

Named Arguments¶

--classes

Location of the file describing files of interest

Default: “/home/docs/checkouts/readthedocs.org/user_builds/cubi-tk/checkouts/latest/cubi_tk/archive/classes.yaml”

--dont-follow-links

Do not follow symlinks to directories. Required when the project contains circular symlinks

Default: False