cajcodes
/

dqn-floorplan-navigator

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

cajcodes commited on May 18

Commit

502a473

•

1 Parent(s): b1581fb

Update README.md

Files changed (1) hide show

README.md +28 -1

README.md CHANGED Viewed

@@ -39,6 +39,14 @@ The model was trained using a hybrid approach:
 - Target Update Frequency: Every 100 episodes
 - Number of Episodes: 50
 ## Usage
 To use this model, load the saved state dictionary and initialize the DQN with the same architecture. The model can then be used to navigate a floorplan and find the most efficient path to the target.
@@ -84,6 +92,21 @@ state = ...  # Define your state here
 with torch.no_grad():
     action = model(torch.tensor(state, dtype=torch.float32).unsqueeze(0)).argmax().item()
 ```
 ## Evaluation
 The model was evaluated based on:
@@ -104,6 +127,10 @@ The model was evaluated based on:
 This project leverages the power of reinforcement learning combined with traditional pathfinding algorithms to navigate complex environments efficiently.
 ## Citation
 If you use this model in your research, please cite it as follows:
@@ -112,7 +139,7 @@ If you use this model in your research, please cite it as follows:
 author = {Christopher Jones},
 title = {Deep Q-Network for Floorplan Navigation},
 year = {2024},
-howpublished = {\url{https://huggingface.co/cajcodes/dqn-floorplan-finder}},
 note = {Accessed: YYYY-MM-DD}
 }
 ```

 - Target Update Frequency: Every 100 episodes
 - Number of Episodes: 50
+## Checkpoints
+Checkpoints are saved during training for convenience:
+- `checkpoint_11.pth.tar`: After 11 episodes
+- `checkpoint_21.pth.tar`: After 21 episodes
+- `checkpoint_31.pth.tar`: After 31 episodes
+- `checkpoint_41.pth.tar`: After 41 episodes
 ## Usage
 To use this model, load the saved state dictionary and initialize the DQN with the same architecture. The model can then be used to navigate a floorplan and find the most efficient path to the target.
 with torch.no_grad():
     action = model(torch.tensor(state, dtype=torch.float32).unsqueeze(0)).argmax().item()
 ```
+## Training Script
+The training script train.py is included in the repository for those who wish to reproduce the training process or continue training from a specific checkpoint.
+### Training Instructions
+- Clone the repository.
+- Ensure you have the necessary dependencies installed.
+- Run the training script:
+```
+bash
+Copy code
+python train.py
+```
+To continue training from a checkpoint, modify the script to load the checkpoint before training.
 ## Evaluation
 The model was evaluated based on:
 This project leverages the power of reinforcement learning combined with traditional pathfinding algorithms to navigate complex environments efficiently.
+## License
+This model is licensed under the Apache 2.0 License.
 ## Citation
 If you use this model in your research, please cite it as follows:
 author = {Christopher Jones},
 title = {Deep Q-Network for Floorplan Navigation},
 year = {2024},
+howpublished = {\url{https://huggingface.co/cajcodes/dqn-floorplan-navigator}},
 note = {Accessed: YYYY-MM-DD}
 }
 ```