BERT4ETH (PyTorch Version)

This is the PyTorch implementation for the paper BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection, accepted by the ACM Web conference (WWW) 2023.

The implementation of fine-tuning on phishing data was finished by Qiustander (https://github.com/Qiustander). Thanks a lot!

If you find this repository useful, please give us a star and cite our paper : ) Thank you!

Getting Start

Requirements

PyTorch > 1.12.0

Preprocess dataset

Step 1: Download dataset from Google Drive.

Step 2: Unzip dataset under the directory of "BERT4ETH/Data/"

cd BERT4ETH_PyTorch/data; # Labels are already included unzip ...;

Pre-training

Step 1: Transaction Sequence Generation

cd src; python gen_seq.py --bizdate=bert4eth_exp

Step 2: Pre-train BERT4ETH

python run_pretrain.py --bizdate="bert4eth_exp" \ --ckpt_dir="bert4eth_exp"

Step 3: Output Representation

python run_embed.py --bizdate="bert4eth_exp" \ --init_checkpoint="bert4eth_exp/xxx.pth"

Evaluation

Phishing Account Detection

cd eval python phish_detection_mlp.py --input_dir="../outputs/xxx"

De-anonymization (ENS dataset)

python run_dean_ENS.py --metric=euclidean \ --init_checkpoint=bert4eth_exp/model_104000

Fine-tuning for phishing account detection

 Will update later..

Citation

@inproceedings{hu2023bert4eth, title={BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection}, author={Hu, Sihao and Zhang, Zhen and Luo, Bingqiao and Lu, Shengliang and He, Bingsheng and Liu, Ling}, booktitle={Proceedings of the ACM Web Conference 2023}, pages={2189--2197}, year={2023} }

Q&A

If you have any questions, you can either open an issue or contact me (sihaohu@gatech.edu), and I will reply as soon as I see the issue or email.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
downstream		downstream
models		models
pretrain		pretrain
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
config.py		config.py
run_script		run_script
utils.py		utils.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT4ETH (PyTorch Version)

Getting Start

Requirements

Preprocess dataset

Step 1: Download dataset from Google Drive.

Step 2: Unzip dataset under the directory of "BERT4ETH/Data/"

Pre-training

Step 1: Transaction Sequence Generation

Step 2: Pre-train BERT4ETH

Step 3: Output Representation

Evaluation

Phishing Account Detection

De-anonymization (ENS dataset)

Fine-tuning for phishing account detection

Citation

Q&A

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BERT4ETH (PyTorch Version)

Getting Start

Requirements

Preprocess dataset

Step 1: Download dataset from Google Drive.

Step 2: Unzip dataset under the directory of "BERT4ETH/Data/"

Pre-training

Step 1: Transaction Sequence Generation

Step 2: Pre-train BERT4ETH

Step 3: Output Representation

Evaluation

Phishing Account Detection

De-anonymization (ENS dataset)

Fine-tuning for phishing account detection

Citation

Q&A

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages