Doing simple retrieval from LLM models at various context lengths to measure accuracy
Why do you think that https://github.com/SillyTavern/SillyTavern is a good alternative to LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Why do you think that https://github.com/SillyTavern/SillyTavern is a good alternative to LLMTest_NeedleInAHaystack