-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Datatype Support in Quality Control and Impute (#865)
* Enhancement: Add Dask support for explicit imputation * Enhancement: Add Dask support for quality control metrics and imputation tests * Fix test for imputation to handle Dask arrays without raising errors * Refactor quality control metrics functions to streamline computation and improve readability * added expected error * Remove unused Dask import from quality control module * simplify missing value computation * Rename parameter 'arr' to 'mtx' in _compute_obs_metrics no longer creates copy * daskify qc_metrics * Add fixture for array types and update imputation tests for dask arrays * Refactor _compute_var_metrics to prevent modification of the original data matrix and add a test for encoding mode integrity * Add parameterized tests for array types in miceforest imputation * Update missing values handling to include array type in error message and refine parameterized tests for miceforest imputation * Fix array type handling in missing values computation and update test for miceforest imputation * Implement array type handling in load_dataframe function and update tests for miceforest imputation * Remove parameterization for array types in miceforest numerical data imputation test * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <[email protected]> * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <[email protected]> * revert deepcopy changes * Fix test to ensure original matrix is not modified after encoding * Remove unused parameters from observation and variable metrics computation functions * Add sparse.csr_matrix to explicit impute array types test case * Parameterize quality control metrics tests to support multiple array types * Remove unused imports from test_quality_control.py * encode blocks dask function * Add pytest fixtures for observation and variable data in tests * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <[email protected]> * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <[email protected]> * support dask explicit impute all object types --------- Co-authored-by: eroell <[email protected]> Co-authored-by: Lukas Heumos <[email protected]> Co-authored-by: Eljas Roellin <[email protected]>
- Loading branch information
1 parent
324a978
commit 4c4db5f
Showing
5 changed files
with
277 additions
and
97 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.