The new tokenizer is interesting, but it definitely is possible to adapt a base ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		aesthesia 2 days ago \| parent \| context \| favorite \| on: Claude Opus 4.7 The new tokenizer is interesting, but it definitely is possible to adapt a base model to a new tokenizer without too much additional training, especially if you're distilling from a model that uses the new tokenizer. (see, e.g., https://openreview.net/pdf?id=DxKP2E0xK2).

		help

ACCount37 2 days ago [–]

Not impossible, but you have to be at least a little bit mad to deploy tokenizer replacement surgery at this scale.

They also changed the image encoder, so I'm thinking "new base model". Whatever base that was powering 4.5/4.6 didn't last long then.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact