Translated data: The University of California, Berkeley has released Starling-7B, utilizing the RLAIF method to enhance performance through AI feedback, achieving significant progress in both safety and helpfulness. Research indicates that RLAIF primarily improves the model's helpfulness and safety, with relatively minor enhancements to basic capabilities. Future studies may incorporate high-quality human feedback to better meet human needs.