Skip to content

Commit cf7ab8c

Browse files
committed
refactoring readme
1 parent cfd288f commit cf7ab8c

File tree

1 file changed

+10
-5
lines changed

1 file changed

+10
-5
lines changed

README.md

Lines changed: 10 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,18 @@
11
# NextCoder
22

33
<p align="center">
4-
🤗 <a href="https://huggingface.co/microsoft">Hugging Face</a>&nbsp&nbsp | &nbsp&nbsp 📑 <a href="https://arxiv.org/#">Paper</a>
4+
🤗 <a href="https://huggingface.co/microsoft">Hugging Face</a>&nbsp&nbsp | &nbsp&nbsp 📑 <a href="https://arxiv.org/abs/2503.03656">Paper</a>
55
</p>
66

7-
## Assets (this is temporary section and will be removed)
8-
- Only new synthetic dataset
9-
- Models and usage
10-
- Training reciepe
7+
## Introduction
8+
This repository hosts the official code and data artifact for the paper [NextCoder: Robust Learning of Diverse Code Edits
9+
](https://arxiv.org/abs/2503.03656)
10+
11+
The work is the development of code-editing LLMs, synthetic data generation pipeline and a novel finetuning methodology.
12+
13+
## Repository Structure
14+
- [data](data/): contains the scripts and files required to generate synthetic dataset for code-editing as per the pipeline proposed in the paper
15+
- [train](src/train/) contains the scripts and files for finetuning with SeleKT, LoRA and usual SFT.
1116

1217
## Model Usage
1318
```python

0 commit comments

Comments
 (0)