Exploring Polysemanticity and Superposition
Confusions to study in Toy Models of Superposition
4.9
Adapt their ReLU output model to have a different range of feature values, and see how this affects things. Make the features uniform [0.5, 1]
April 30, 2023; Kunvar(firstuserhere)