Stars
Stanford NLP Python library for Representation Finetuning (ReFT)
This is an open-source version of the representation engineering framework for stopping harmful outputs or hallucinations on the level of activations. 100% free, self-hosted and open-source.