Research Scientist, Interpretability at anthropic