- San Francisco, CA
- http://www.linkedin.com/in/gerashegalov/
- @gerashegalov
-
spark-rapids Public
Forked from NVIDIA/spark-rapidsSpark RAPIDS plugin - accelerate Apache Spark with GPUs
Scala Apache License 2.0 UpdatedJan 23, 2025 -
spark-rapids-jni Public
Forked from NVIDIA/spark-rapids-jniRAPIDS Accelerator JNI For Apache Spark
Cuda Apache License 2.0 UpdatedDec 11, 2024 -
rapids-shell Public
Utility to run/debug Spark RAPIDS in REPL
-
spark-rapids-examples Public
Forked from NVIDIA/spark-rapids-examplesA repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
Python Apache License 2.0 UpdatedDec 6, 2024 -
-
cudf Public
Forked from rapidsai/cudfcuDF - GPU DataFrame Library
C++ Apache License 2.0 UpdatedOct 2, 2024 -
spark-rapids-benchmarks Public
Forked from NVIDIA/spark-rapids-benchmarksSpark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Python Apache License 2.0 UpdatedJan 23, 2024 -
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedAug 24, 2023 -
t-digest Public
Forked from tdunning/t-digestA new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Java Apache License 2.0 UpdatedJul 10, 2023 -
xgboost Public
Forked from dmlc/xgboostScalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
C++ Apache License 2.0 UpdatedSep 8, 2022 -
rmm Public
Forked from rapidsai/rmmRAPIDS Memory Manager
C++ Apache License 2.0 UpdatedJun 6, 2022 -
-
hadoop Public
Forked from apache/hadoopMirror of Apache Hadoop
Java Apache License 2.0 UpdatedJan 12, 2022 -
takari-local-repository Public
Forked from takari/takari-local-repositoryJava Eclipse Public License 1.0 UpdatedOct 13, 2020 -
TransmogrifAI Public
Forked from salesforce/TransmogrifAITransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Spark with minimal hand tuning
Scala BSD 3-Clause "New" or "Revised" License UpdatedSep 12, 2020 -
schema-registry Public
Forked from confluentinc/schema-registryConfluent Schema Registry for Kafka
Java Other UpdatedOct 1, 2019 -
transmogrifai-helloworld-sbt Public
Forked from salesforce/transmogrifai-helloworld-sbtScala BSD 3-Clause "New" or "Revised" License UpdatedSep 18, 2019 -
scalaj-http Public
Forked from scalaj/scalaj-httpSimple scala wrapper for HttpURLConnection. OAuth included.
Scala Apache License 2.0 UpdatedSep 18, 2018 -
hdfs-mount Public
Forked from microsoft/hdfs-mountA tool to mount HDFS as a local Linux file system
Go Other UpdatedJun 29, 2018 -
aardpfark Public
Forked from CODAIT/aardpfarkA library for exporting Spark ML models and pipelines to PFA
Scala Apache License 2.0 UpdatedJun 8, 2018 -
azkaban Public
Forked from azkaban/azkabanAzkaban workflow manager.
Java Apache License 2.0 UpdatedApr 7, 2018 -
parquet-mr Public
Forked from apache/parquet-javaMirror of Apache Parquet
Java Apache License 2.0 UpdatedMay 25, 2017 -
scalding Public
Forked from twitter/scaldingA Scala API for Cascading
Scala Apache License 2.0 UpdatedApr 28, 2016 -
presto Public
Forked from prestodb/prestoDistributed SQL query engine for running interactive analytic queries against big data sources.
Java Apache License 2.0 UpdatedDec 29, 2015 -
elephant-bird Public
Forked from twitter/elephant-birdTwitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Java Apache License 2.0 UpdatedNov 20, 2015 -
Impala Public
Forked from KarthikTunga/impalaReal-time Query for Hadoop
C++ Apache License 2.0 UpdatedAug 6, 2015 -
testsplits Public
Standalone tool to benchmark LzoInputFormat getSplits performance
Java UpdatedApr 6, 2015 -
-
cascading Public
Forked from cwensel/cascadingCascading is a feature rich API for defining and executing complex and fault tolerant data processing flows on a Hadoop cluster. See https://github.com/Cascading/cascading for the release repository.
Java Other UpdatedMar 13, 2015 -
Impatient Public
Forked from Cascading/Impatientsource examples to support the "Cascading for the Impatient" blog post series
Java UpdatedAug 7, 2014