abliterate

English dictionary entry

Meanings

verb
  1. To uncensor a large language model by modifying specific model internals to remove refusal behaviours or unwanted traits, while aiming to preserve the model's other capabilities.

Word forms

abliterate abliterates abliterating abliterated

Etymology

Blend of ablate + obliterate. Coined by Redditor /u/FailSpai in early 2024, as the idea is to ablate refusal features to the point of obliteration.

This entry uses open data from Wiktionary (CC BY-SA/GFDL). Word forms are used for search and are not indexed as separate pages.