Skip to content

Commit b582d36

Browse files
committed
Adding Middleware'24 configs, build options to install.sh
1 parent 5da2272 commit b582d36

File tree

15 files changed

+171
-18
lines changed

15 files changed

+171
-18
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -14,16 +14,16 @@ DedupBench is a benchmarking tool for data chunking techniques used in data dedu
1414

1515
It currently supports eleven different chunking algorithms and six different fingerprinting algorithms. It supports SIMD acceleration with [VectorCDC](https://www.usenix.org/conference/fast25/presentation/udayashankar) for these algorithms using five different vector instruction sets on Intel, AMD, ARM, and IBM CPUs.
1616

17-
The following chunking techniques and SIMD accelerations are currently supported by DedupBench.
17+
The following chunking techniques and vector instruction sets are currently supported by DedupBench.
1818

1919
| CDC Algorithm | Link | Unaccelerated | SSE-128 | AVX-256 | AVX-512 | NEON-128 (ARM) | VSX-128 (IBM) |
2020
| :-------: | :-------: | :-------: | :-------: | :-------: | :-------: | :-------: | :-------: |
2121
| AE-Max | [Paper](https://ieeexplore.ieee.org/document/7218510) | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
2222
| AE-Min | [Paper](https://ieeexplore.ieee.org/document/7218510) | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
23-
| CRC-32 | [GitHub](https://github.com/google/crc32c) | ✔️ ||||||
23+
| CRC-32 | [Paper](https://dl.acm.org/doi/10.1145/3319647.3325834) | ✔️ ||||||
2424
| FastCDC | [Paper](https://www.usenix.org/conference/atc16/technical-sessions/presentation/xia) | ✔️ ||||||
2525
| Fixed-size | [Paper](https://www.usenix.org/conference/fast-02/venti-new-approach-archival-data-storage) | ✔️ ||||||
26-
| Gear | [Paper](https://www.sciencedirect.com/science/article/pii/S0166531614000790) | ✔️ ||||||
26+
| Gear | [Paper](https://dl.acm.org/doi/10.1145/3319647.3325834) | ✔️ ||||||
2727
| MAXP| [Paper](https://www.sciencedirect.com/science/article/pii/S0022000009000580) | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
2828
| Rabin | [Paper](https://dl.acm.org/doi/abs/10.1145/502034.502052) | ✔️ ||||||
2929
| RAM | [Paper](https://www.sciencedirect.com/science/article/abs/pii/S0167739X16305829) | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
@@ -39,7 +39,7 @@ The following chunking techniques and SIMD accelerations are currently supported
3939
# 🚀 Quick start guide
4040
To quickly get started, run the following commands on Ubuntu:
4141

42-
1. Clone repository and create a basic build without SIMD acceleration.
42+
1. Clone repository and choose a basic build without SIMD acceleration.
4343
```
4444
git clone [email protected]:UWASL/dedup-bench.git
4545
cd dedup-bench/
@@ -63,7 +63,7 @@ To see a real dataset in action and generate the graph below, download and use t
6363

6464
# ⚡ DedupBench SIMD Builds
6565

66-
To use any of the vector-accelerated CDC algorithms, an alternative Dedupbench build is required. We have provided preconfigured files for all algorithms with 8KB chunk sizes for convenience.
66+
To use any of the vector-accelerated CDC algorithms, an alternative Dedupbench build is required. Choose the right SIMD build when running `install.sh` or use the manual commands commands below. We have provided preconfigured files for all algorithms with 8KB chunk sizes for convenience.
6767

6868
**_Note that building with the wrong options (such as AVX-256 on a machine without AVX-256 support) may result in compile / runtime errors._**
6969

build/Makefile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ avx512:
4747

4848
.PHONY: arm_neon128
4949
arm_neon128:
50-
$(MAKE) all
50+
$(MAKE) EXTRA_COMPILER_FLAGS="-mfpu=neon" all
5151

5252
.PHONY: ibm_altivec128
5353
ibm_altivec128:
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
# General Parameters
2+
chunking_algo=ae
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/aemax_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# AE Parameters
10+
ae_extreme_mode=max
11+
ae_avg_block_size=16384
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
# General Parameters
2+
chunking_algo=fastcdc
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/fastcdc_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# FastCDC Parameters
10+
fastcdc_min_block_size=8192
11+
fastcdc_avg_block_size=16384
12+
fastcdc_max_block_size=32768
13+
fastcdc_normalization_level=2
14+
fastcdc_disable_normalization=false
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
# General Parameters
2+
chunking_algo=rabins
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/rabins_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# Rabin Chunking Parameters
10+
rabinc_min_block_size=8192
11+
rabinc_avg_block_size=16384
12+
rabinc_max_block_size=32768
13+
rabinc_window_size=48
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
# General Parameters
2+
chunking_algo=ram
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/ram_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# RAM Parameters
10+
ram_max_block_size=32768
11+
ram_avg_block_size=16384
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
# General Parameters
2+
chunking_algo=seq
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/seqcdc_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# SeqCDC Parameters
10+
seq_op_mode=inc
11+
12+
seq_jump_trigger=50
13+
seq_sequence_threshold=5
14+
seq_jump_size=512
15+
16+
seq_min_block_size=8192
17+
seq_avg_block_size=16384
18+
seq_max_block_size=32768
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
# General Parameters
2+
chunking_algo=tttd
3+
hashing_algo=sha1
4+
output_file=./hashes_16kb_middleware24/tttd_16kb.out
5+
simd_mode=none
6+
7+
buffer_size=32768
8+
9+
# TTTD Parameters
10+
tttd_min_block_size=8192
11+
tttd_avg_block_size=16384
12+
tttd_max_block_size=32768
13+
14+
# Rabin Chunking Parameters
15+
rabinc_min_block_size=8192
16+
rabinc_avg_block_size=16384
17+
rabinc_max_block_size=32768
18+
rabinc_window_size=48

build/config_simd512_8kb/maxp_8kb.conf

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,5 +7,5 @@ simd_mode=none
77
buffer_size=32768
88

99
# MAXP Parameters
10-
maxp_window_size = 960
10+
maxp_window_size = 1024
1111
maxp_max_block_size = 32768

build/config_simd512_8kb/vmaxp512_8kb.conf

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,5 +7,5 @@ simd_mode=avx512
77
buffer_size=32768
88

99
# MAXP Parameters
10-
maxp_window_size = 960
10+
maxp_window_size = 1024
1111
maxp_max_block_size = 32768

0 commit comments

Comments
 (0)