minor fix

hakuhodo-technologies · Nov 15, 2023 · b736356 · b736356
1 parent 7b1c373
commit b736356
Show file tree

Hide file tree

Showing 4 changed files with 31 additions and 33 deletions.
diff --git a/basicgym/README.md b/basicgym/README.md
@@ -17,9 +17,9 @@
 
 ## Overview
 
-*BasicGym* is an open-source simulation platform for synthetic simulation, which is written in Python. The simulator is particularly intended for reinforcement learning algorithms and follows [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface. We design SyntheticGym as a configurative environment so that researchers and practitioners can customize the environmental modules including `StateTransitionFunction` and `RewardFunction`
+*BasicGym* is an open-source simulation platform for synthetic simulation, which is written in Python. The simulator is particularly intended for reinforcement learning algorithms and follows [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface. We design BasicGym as a configurative environment so that researchers and practitioners can customize the environmental modules including `StateTransitionFunction` and `RewardFunction`
 
-Note that SyntheticGym is publicized under [scope-rl](../) repository, which facilitates the implementation of the offline reinforcement learning procedure.
+Note that BasicGym is publicized under [scope-rl](../) repository, which facilitates the implementation of the offline reinforcement learning procedure.
 
 ### Basic Setting
 
@@ -33,21 +33,21 @@ We formulate the following (Partially Observable) Markov Decision Process ((PO)M
 
 ### Implementation
 
-SyntheticGym provides a standardized environment in both discrete and continuous action settings.
+BasicGym provides a standardized environment in both discrete and continuous action settings.
 - `"BasicEnv-continuous-v0"`: Standard continuous environment.
 - `"BasicEnv-discrete-v0"`: Standard discrete environment.
 
-SyntheticGym consists of the following environment.
+BasicGym consists of the following environment.
 - [BasicEnv](./envs/basic.py#L18): The basic configurative environment.
 
-SyntheticGym is configurative about the following module.
+BasicGym is configurative about the following module.
 - [StateTransitionFunction](./envs/simulator/function.py#L14): Class to define the state transition function.
 - [RewardFunction](./envs/simulator/function.py#L101): Class to define the reward function.
 
 Note that users can customize the above modules by following the [abstract class](./envs/simulator/base.py).
 
 ## Installation
-SyntheticGym can be installed as a part of [scope-rl](../) using Python's package manager `pip`.
+BasicGym can be installed as a part of [scope-rl](../) using Python's package manager `pip`.
 ```
 pip install scope-rl
 ```
@@ -64,12 +64,12 @@ python setup.py install
 We provide an example usage of the standard and customized environment. \
 The online/offline RL and Off-Policy Evaluation examples are provided in [SCOPE-RL's README](../README.md).
 
-### Standard SyntheticEnv
+### Standard BasicEnv
 
-Our standard SyntheticEnv is available from `gym.make()`, following the [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface.
+Our standard BasicEnv is available from `gym.make()`, following the [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface.
 
 ```Python
-# import SyntheticGym and gym
+# import BasicGym and gym
 import basicgym
 import gym
 
@@ -134,9 +134,9 @@ plt.show()
 </p>
 </figcaption>
 
-Note that while we use [SCOPE-RL](../README.md) and [d3rlpy](https://github.com/takuseno/d3rlpy) here, SyntheticGym is compatible with any other libraries working on the [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface.
+Note that while we use [SCOPE-RL](../README.md) and [d3rlpy](https://github.com/takuseno/d3rlpy) here, BasicGym is compatible with any other libraries working on the [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface.
 
-### Customized SyntheticEnv
+### Customized BasicEnv
 
 Next, we describe how to customize the environment by instantiating the environment.
 
@@ -257,7 +257,7 @@ Bibtex:
 
 ## Contribution
 
-Any contributions to SyntheticGym are more than welcome!
+Any contributions to BasicGym are more than welcome!
 Please refer to [CONTRIBUTING.md](../CONTRIBUTING.md) for general guidelines on how to contribute the project.
 
 ## License

diff --git a/examples/quickstart_ja/rtb/rtb_synthetic_customize_env_ja.ipynb b/examples/quickstart_ja/rtb/rtb_synthetic_customize_env_ja.ipynb
@@ -724,7 +724,7 @@
     "\n",
     "@dataclass\n",
     "class CustomizedWinningPriceDistribution(BaseWinningPriceDistribution):\n",
-    "    \"\"\"Initialization.\"\"\"\n",
+    "    \"\"\"初期化.\"\"\"\n",
     "    n_ads: int\n",
     "    n_users: int\n",
     "    ad_feature_dim: int\n",
@@ -746,7 +746,7 @@
     "        bid_prices: np.ndarray,\n",
     "        **kwargs,\n",
     "    ) -> Tuple[np.ndarray]:\n",
-    "        \"\"\"各オークションのインプレッションとセカンドプライスを確率的に決定する．.\"\"\"\n",
+    "        \"\"\"各オークションのインプレッションとセカンドプライスを確率的に決定する．\"\"\"\n",
     "        #  単純な正規分布からの落札価格のサンプリング\n",
     "        winning_prices = self.random_.normal(\n",
     "            loc=self.standard_bid_price,\n",

diff --git a/recgym/README.md b/recgym/README.md
@@ -226,18 +226,17 @@ The statistics of the environment is also visualized at [quickstart/rec/rec_synt
 
 If you use our software in your work, please cite our paper:
 
-Haruka Kiyohara, Kosuke Kawakami, Yuta Saito.<br>
-**Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation**<br>
-(RecSys'21 SimuRec workshop)<br>
-[https://arxiv.org/abs/2109.08331](https://arxiv.org/abs/2109.08331)
+Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito.<br>
+**SCOPE-RL: A Python Library for Offline Reinforcement Learning, Off-Policy Evaluation, and Policy Selection**<br>
+[link]() (a preprint coming soon..)
 
 Bibtex:
 ```
-@article{kiyohara2021accelerating,
-  title={Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation},
-  author={Kiyohara, Haruka and Kawakami, Kosuke and Saito, Yuta},
-  journal={arXiv preprint arXiv:2109.08331},
-  year={2021}
+@article{kiyohara2023towards,
+  author = {Kiyohara, Haruka and Kishimoto, Ren and Kawakami, Kosuke and Kobayashi, Ken and Nataka, Kazuhide and Saito, Yuta},
+  title = {SCOPE-RL: A Python Library for Offline Reinforcement Learning, Off-Policy Evaluation, and Policy Selection},
+  journal={arXiv preprint arXiv:23xx.xxxxx},
+  year = {2023},
 }
 ```
 

diff --git a/rtbgym/README.md b/rtbgym/README.md
@@ -166,7 +166,7 @@ plt.show()
 
 Note that while we use [SCOPE-RL](../README.md) and [d3rlpy](https://github.com/takuseno/d3rlpy) here, RTBGym is compatible with any other libraries working on the [OpenAI Gym](https://gym.openai.com) and [Gymnasium](https://gymnasium.farama.org/)-like interface.
 
-### Customized RTGEnv
+### Customized RTBEnv
 
 Next, we describe how to customize the environment by instantiating the environment.
 
@@ -359,18 +359,17 @@ Finally, example usages for online/offline RL and OPE/OPS studies are available
 
 If you use our software in your work, please cite our paper:
 
-Haruka Kiyohara, Kosuke Kawakami, Yuta Saito.<br>
-**Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation**<br>
-(RecSys'21 SimuRec workshop)<br>
-[https://arxiv.org/abs/2109.08331](https://arxiv.org/abs/2109.08331)
+Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito.<br>
+**SCOPE-RL: A Python Library for Offline Reinforcement Learning, Off-Policy Evaluation, and Policy Selection**<br>
+[link]() (a preprint coming soon..)
 
 Bibtex:
 ```
-@article{kiyohara2021accelerating,
-  title={Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation},
-  author={Kiyohara, Haruka and Kawakami, Kosuke and Saito, Yuta},
-  journal={arXiv preprint arXiv:2109.08331},
-  year={2021}
+@article{kiyohara2023towards,
+  author = {Kiyohara, Haruka and Kishimoto, Ren and Kawakami, Kosuke and Kobayashi, Ken and Nataka, Kazuhide and Saito, Yuta},
+  title = {SCOPE-RL: A Python Library for Offline Reinforcement Learning, Off-Policy Evaluation, and Policy Selection},
+  journal={arXiv preprint arXiv:23xx.xxxxx},
+  year = {2023},
 }
 ```