Language-Specific-Neurons

This repository is the official implementation of our paper "Exploring Language-Specific Regions in Large Language Models".

Language Neurons Found by LAPE

We provide our found language-specific neurons in LLaMA-2 (7B), LLaMA-2 (13B), LLaMA-2 (70B), BLOOM (7B), OPT (6.7B), Mistral (7B), and Phi-2 (2.7B).

You should use torch.load to load xxx.neuron.pth, each of which is a List[List[LongTensor]], neuron[i][j] represents the neuron indice of the i-th language in the j-th layer in the model. The language 0-6 indice stand for en, zh, fr, es, vi, id, ja. For example, LLaMA-2-7B[1][4]=tensor([6147, 9114, 9292]), which means that the Chinese neurons inside the 4-th layer of LLaMA-2-7B are of the indice 6147, 9114, and 9292.

Identifying Language-specific Neurons

Record the activation state:

python activation.py -m meta-llama/Llama-2-7b-hf -l xx

Identifying language-specific neurons:

python identify.py

Computing PPL when Deactivating Neurons

python ppl.py -m meta-llama/Llama-2-7b-hf -a activation_mask/xxx

Open-ended Generation when Deactivating Neurons

python generation.py -m meta-llama/Llama-2-7b-hf -a activation_mask/xxx

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
activations		activations
data/mvicuna		data/mvicuna
BLOOM-7B.neuron.pth		BLOOM-7B.neuron.pth
LLaMA-2-13B.neuron.pth		LLaMA-2-13B.neuron.pth
LLaMA-2-70B.neuron.pth		LLaMA-2-70B.neuron.pth
LLaMA-2-7B.neuron.pth		LLaMA-2-7B.neuron.pth
Mistral-7B.neuron.pth		Mistral-7B.neuron.pth
OPT-6.7B.neuron.pth		OPT-6.7B.neuron.pth
Phi-2-2.7B.neuron.pth		Phi-2-2.7B.neuron.pth
README.md		README.md
activation.py		activation.py
generation.py		generation.py
identify.py		identify.py
ppl.py		ppl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language-Specific-Neurons

Language Neurons Found by LAPE

Identifying Language-specific Neurons

Computing PPL when Deactivating Neurons

Open-ended Generation when Deactivating Neurons

About

Releases

Packages

Languages

RUCAIBox/Language-Specific-Neurons

Folders and files

Latest commit

History

Repository files navigation

Language-Specific-Neurons

Language Neurons Found by LAPE

Identifying Language-specific Neurons

Computing PPL when Deactivating Neurons

Open-ended Generation when Deactivating Neurons

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages