Long Input LLM Test
Framework: anthropic
Running test...