9 Commits

Author SHA1 Message Date
Federico Brancasi
c33e376aa0 Allow invoking simulationMain with alternative action input (#4304)
* Allow invoking simulationMain with alternative action input

* Address review comments: rename CLI opts, extract pipeline, fix correctness issues

- Rename CLI options with --train- prefix (--train-input, --train-strategy,
  --train-out, --train-row-offset, --train-worker) and document all options
- Extract runInputPipeline/runInputPipelineParallel to test/pipeline/trainPipeline.ts
- Preserve original row index through parse/replay/prompt pipeline to fix
  sample numbering drift when rows are filtered out
- Fix parseSuggestedEdit: use JSON.parse for escaped text, handle missing delimiter
- Fix line number regex to accept optional space after | (WithoutSpace format)
- Clamp concurrency to >= 1, type samples as ISample[], wrap dispose in try/finally
- Gate verbose logging in loadAndParseInput behind verbose flag
- Use splitLines from existing utility instead of local duplicate

* move nes-datagen to a subcommand

* more code reuse around setting promptStrategy and model config

* Address review: use ResponseFormat, Limiter, assertNever, and raw messages

* minor refactor runPipeline

* finalize

* use POT instead of custom code

* move files from script/ to test/pipeline/

---------

Co-authored-by: ulugbekna <ulugbekna@gmail.com>
2026-03-30 15:44:46 +00:00
Matt Bierner
3c8134184b Enable no-unexternalized-strings in repo (#2448)
Enables the same `no-unexternalized-strings` with have in `vscode` in this repo. This make sure we have a more consistent style across repos and when generating edits
2025-12-05 18:45:12 +00:00
Ulugbek Abdullaev
f082551888 nes: remove old settings to configure model (#1823) 2025-11-06 14:04:18 +00:00
Matt Bierner
87410ce2d0 Update embedding names for simulation/test (#642) 2025-08-18 20:03:50 +00:00
SteVen Batten
e18cf050d9 Support external cache layers (#572)
* support external cache layers

* cleanup

* address feedback
2025-08-18 14:36:23 +00:00
Matt Bierner
21aba584ae Unify embeddings paths (#595)
* Unify embeddings paths

Our code has two embeddings paths for legacy reasons:

- The capi api based one that only supports text3small. This is used everywhere except `#codebase`

- The new github embeddings endpoint based. This one supports additional embedding types and options. However it is currently only used in `#codebase`

This change switches everything to use the new github embeddings endpoint instead

* Fix static deps in tests
2025-08-14 14:34:41 +00:00
Vritant Bhardwaj
f167b31c28 Add ability to specify models through config for simulation tests (#324)
* add ability to specify modelConfig from file

* use custom model in test endpoint provider

* fix model check

* change model config to be independant from specifying model to run

* support yaml

* use readFileSync

* remove yaml parsing

* improve comment

* fix api key

* fix comment

* make opt parsing alig to comment
2025-07-23 17:03:54 +00:00
Logan Ramos
27577393b6 Allow skipping /models cache (#336) 2025-07-22 19:43:42 +00:00
kieferrm
333d9a4053 Hello Copilot
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
2025-06-27 11:35:20 +02:00