Skip to content

Commit 6d6fa78

Browse files
upd readme and gini result
1 parent f162599 commit 6d6fa78

File tree

3 files changed

+16
-7
lines changed

3 files changed

+16
-7
lines changed

README.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -45,24 +45,22 @@ Python (3.7.0 or higher, and [Anaconda](https://www.anaconda.com/) recommended)
4545

4646
- Specify the result summary log file using `-o FILE_NAME`; Export an additional result summary CSV `FILE_NAME_prefix.csv` (with success and timing statistics) using `-a`; The summary is only available when running multiple instances (directory mode or file list mode)
4747

48-
- Start testing from the file index k in the folder `-s K` (`K` is the index starting from zero)
48+
- Start solving from the file index `K` in the folder `-s K` (`K` is the index starting from zero)
4949

5050
- If you want to run multiple instances, make sure to use different `FILE_NAME`-s in the config file to avoid clash (`config.yml` in default)
5151

5252
- More options see `--help`
5353

5454
- After finishing running, the `./tmp` directory can be deleted safely
5555

56-
# To reproduce Chronosymbolic-single
56+
# To reproduce the major result: Chronosymbolic-single
5757

58-
Please refer to the configuration in `./experiment/result_summary.log`. Using the default config in `config.yml` should also be decent. Even fixed random seeds can cause minor randomness that may slightly affect the performance.
58+
Please refer to the configuration in `./experiment/result_summary.log` and `./experiment/README.md` (where settings for other minor experiments are also provided). Using the default config in `config.yml` should also be decent. Even fixed random seeds can cause minor randomness that may slightly affect the performance.
5959

6060
- `python test.py -f tests/safe -a -r -v -t 360 -o result/result_safe.log`
6161

6262
- `python test.py -f tests/unsafe -a -r -v -t 360 -o result/result_unsafe.log`
6363

64-
- `python test.py -f tests/multiple_pred -a -r -v -t 360 -o result/result_multi.log`
65-
6664

6765
# To run the baselines
6866
## Spacer and GSpacer

experiment/README.md

Lines changed: 13 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,18 @@ The specifications of the device used to generate this log:
2121

2222

2323
## comparison.xlsx
24-
Detailed running time data on our major performance evaluation in the experiment section.
24+
Detailed running time data on our major performance evaluation in the experiment section of our paper.
2525

2626
## result_rnd_seed.xlsx
27-
Detailed running time data on our performance evaluation with different random seeds is described in our Appendix.
27+
Detailed running time data on our performance evaluation with different random seeds is described in the Appendix of the paper. We only show safe instances as an example. In `result_rnd_seed_gini.xlsx`, the only difference is to use Gini impurity instead of Shannon Entropy in DTs.
28+
29+
## To reproduce Chronosymbolic-cover
30+
Unfortunately due to the incompleteness of our logging system then, the hyperparameters of the 13 runs are not fully recorded. We provide essential experiments needed to run to reproduce the result:
31+
32+
1. Different strategies on scheduling the candidate hypothesis in Table 1 of our paper (e.g., tuning the hyperparameters in `SafeZoneUsage: '(self.total_iter // 200) % 2 == 0', UnsafeZoneUsage: '(self.total_iter // 100) % 2 == 0`);
33+
2. Using different DT settings (may try random DT as well that may not work well overall but works on some specific instances) and Agents (set in `./test.py`);
34+
3. Different dataset settings (enable the queue mode or not, how many samples should the datasets keep);
35+
4. Different expansion strategy for the reasoner (`Expansion` in `./config.yml`).
36+
37+
## To reproduce results of CHC-COMP-22-LIA
38+
The default settings in `./config.yml` should be decent (might be a little bit worse than the best result but within 5%). Note that, if the timeout malfunctions for some instances, interrupt the tool manually and use the `-s K` option starting from the file index `K` in the folder.
49.1 KB
Binary file not shown.

0 commit comments

Comments
 (0)