Evaluating AI's general performance & finding models for unrestricted prompting remains a challenge