Automated testing framework for evaluating LLM long-context capabilities using the Harry Potter novel series.