Alignment Research aims to steer AI systems (e.g. large language models)toward humans’ intended goals, preferences, or ethical principles