gap text2sql下載 - gap text2sql源代碼下載

gap text2sql

Ai源碼

1.0.0

下載

GAP-TEXT2SQL：學習語義解析的上下文表示，並通過一代增強的預訓練

我們AAAI 2021論文的代碼和模型

更新

[2020/02/05]支持在自己的數據庫和查詢上運行模型。查看筆記本。

抽象的

最近，通過利用大規模文本語料庫來培訓具有自我監督的學習目標（例如蒙版語言模型（MLM））的大規模神經語言模型，對學習各種NLP任務的上下文表示有重大興趣。但是，基於一項試點研究，當將它們應用於文本到SQL語義解析器時，我們會觀察到現有通用語言模型的三個問題：無法檢測到列表中的列提及，無法從單元格值中推斷列提及，並且無法構成複雜的SQL查詢。為了減輕這些問題，我們提出了一個模型預訓練框架，即生成增強的預訓練（GAP），該框架共同學習了通過利用生成模型生成預訓練數據的自然語言話語和表格模式的表示。 GAP模型經過2m tustrance-Schema對和30k Tusterance-Schema-Sql三元組的訓練，其發音是由生成模型產生的。基於實驗結果，利用差距模型作為表示編碼器的神經語義解析器獲得了蜘蛛和標準至SQL基準的新最新結果。

設定

conda create --name gap-text2sql python=3.7
source activate gap-text2sql
conda install pytorch=1.5 cudatoolkit=10.2 -c pytorch
pip install -r requirements.txt
python -c " import nltk; nltk.download('stopwords'); nltk.download('punkt') "

下載數據集

pip install gdown
cd rat-sql-gap
gdown --id 1_AckYkinAnhqmRQtGsQgUKAnTHxxX5J0
unzip spider.zip
bash data/spider/generate.sh ./spider

構建數據集目錄

mkdir data/spider-bart
cp ./spider/tables.json data/spider-bart/
cp ./spider/train_spider.json data/spider-bart/
cp ./spider/train_others.json data/spider-bart/
cp ./spider/dev.json data/spider-bart/
ln -s $( pwd ) /spider/database data/spider-bart/database

下載圖書館

mkdir third_party
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2018-10-05.zip
unzip stanford-corenlp-full-2018-10-05.zip -d third_party/

啟動斯坦福圖書館

 pushd third_party/stanford-corenlp-full-2018-10-05
nohup java -mx4g -cp " * " edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 8999 -timeout 15000 > server.log &
popd

下載檢查站

mkdir -p logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/
mkdir ie_dirs
aws s3 cp s3://gap-text2sql-public/checkpoint-artifacts/gap-finetuned-checkpoint logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/model_checkpoint-00041000

mkdir -p pretrained_checkpoint
aws s3 cp s3://gap-text2sql-public/checkpoint-artifacts/pretrained-checkpoint pretrained_checkpoint/pytorch_model.bin

另外，如果您沒有AWSCLI：GAP-FINETNETED-CHACKPOINT和審計的檢查點，則可以在此處下載它們

curl https://gap-text2sql-public.s3.amazonaws.com/checkpoint-artifacts/gap-finetuned-checkpoint -o logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/model_checkpoint-00041000
curl https://gap-text2sql-public.s3.amazonaws.com/checkpoint-artifacts/pretrained-checkpoint -o pretrained_checkpoint/pytorch_model.bin

預處理數據集

python run.py preprocess experiments/spider-configs/gap-run.jsonnet

推理

python run.py eval experiments/spider-configs/gap-run.jsonnet

然後，您可以在路徑中獲得推理結果和評估結果： ie_dirs/bart_run_1_true_1-step41000.infer和ie_dirs/bart_run_1_true_1-step41000.eval 。

訓練

python run.py train experiments/spider-configs/gap-run.jsonnet

安全

有關更多信息，請參見貢獻。

執照

該項目已根據APACHE-2.0許可獲得許可。

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-10
大小 249.7KB
來自於 Github

相關應用

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub actions/download artifact

2024-11-01

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部