snapatac2.pp.import_contacts#
- snapatac2.pp.import_contacts(contact_file, chrom_sizes, *, file=None, sorted_by_barcode=True, bin_size=500000, chunk_size=200, tempdir=None, backend='hdf5')[source]#
Import chromatin contacts.
- Parameters:
contact_file (
Path
) – File name of the fragment file.file (
Path
|None
) – File name of the output h5ad file used to store the result. If provided, result will be saved to a backed AnnData, otherwise an in-memory AnnData is used.chrom_sizes (
Genome
|dict
[str
,int
]) – A Genome object or a dictionary containing chromosome sizes, for example,{"chr1": 2393, "chr2": 2344, ...}
.sorted_by_barcode (
bool
) – Whether the fragment file has been sorted by cell barcodes. Ifsorted_by_barcode == True
, this function makes use of small fixed amout of memory. Ifsorted_by_barcode == False
andlow_memory == False
, all data will be kept in memory. Seelow_memory
for more details.bin_size (
int
) – The size of consecutive genomic regions used to record the counts.chunk_size (
int
) – Increasing the chunk_size speeds up I/O but uses more memory.tempdir (
Path
|None
) – Location to store temporary files. IfNone
, system temporary directory will be used.backend (
Literal
['hdf5'
]) – The backend.
- Returns:
An annotated data matrix of shape
n_obs
xn_vars
. Rows correspond to cells and columns to regions. Iffile=None
, an in-memory AnnData will be returned, otherwise a backed AnnData is returned.- Return type:
AnnData | ad.AnnData