Suggest an alternative to

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Why do you think that https://github.com/Pythagora-io/gpt-pilot is a good alternative to LLMTest_NeedleInAHaystack