Skip to content
This repository has been archived by the owner on Feb 16, 2022. It is now read-only.

ACWalker/CARE-2021

Repository files navigation

CARE-2021

  • pilots folder contain raw exported data from toloka for all our pilots

Saliency Maps

  • saliency_maps.ipynb contains code for generating saliency maps
  • stage-3-fine-tuned-res50.pkl is the pretrained model. It is loaded in saliency_maps.ipynb notebook, use it for inference tasks or CAM generation

Quality Control

  • qc.py includes functions for running majority voting (fixed_annotations, free_text fubcs) and crowdtruth metrics (helper func) on Toloka labeling output. Due to high variability in json outputs (free text vs fine grained vs coarse grained) different annotation extraction parts are commented out within these functions.

Phase 2

  • phase2_results folder contains post-processed outcomes of phase 2 using CrowdTruth
  • phase2_analysis.ipynb contains graph generation based upon the post-processed data

Phase 3

  • phase3_results folder contains phase 3 checkbox outcomes (exported directly from toloka): 2 files for two completed pools
  • phase3_aggregation.ipynb contains aggregation based on final results

Library dependencies

  • pytorch
  • fastai
  • slugify
  • stringcase
  • matplotlib
  • scikit-image
  • tqdm
  • pandas
  • deep_translator
  • CrowdTruth

Our project makes use of CrowdTruth framework

@article{CrowdTruth2,
  author    = {Anca Dumitrache and Oana Inel and Lora Aroyo and Benjamin Timmermans and Chris Welty},
  title     = {CrowdTruth 2.0: Quality Metrics for Crowdsourcing with Disagreement},
  year      = {2018},
  url       = {https://arxiv.org/abs/1808.06080},
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published