Generative AI Post navigation Building better AI benchmarks: How many raters are enough? Improving the academic workflow: Introducing two AI agents for better figures and peer review