gap text2sql下载 - gap text2sql源代码下载

gap text2sql

Ai源码

1.0.0

下载

GAP-TEXT2SQL：学习语义解析的上下文表示，并通过一代增强的预训练

我们AAAI 2021论文的代码和模型

更新

[2020/02/05]支持在自己的数据库和查询上运行模型。查看笔记本。

抽象的

最近，通过利用大规模文本语料库来培训具有自我监督的学习目标（例如蒙版语言模型（MLM））的大规模神经语言模型，对学习各种NLP任务的上下文表示有重大兴趣。但是，基于一项试点研究，当将它们应用于文本到SQL语义解析器时，我们会观察到现有通用语言模型的三个问题：无法检测到列表中的列提及，无法从单元格值中推断列提及，并且无法构成复杂的SQL查询。为了减轻这些问题，我们提出了一个模型预训练框架，即生成增强的预训练（GAP），该框架共同学习了通过利用生成模型生成预训练数据的自然语言话语和表格模式的表示。 GAP模型经过2m tustrance-Schema对和30k Tusterance-Schema-Sql三元组的训练，其发音是由生成模型产生的。基于实验结果，利用差距模型作为表示编码器的神经语义解析器获得了蜘蛛和标准至SQL基准的新最新结果。

设置

conda create --name gap-text2sql python=3.7
source activate gap-text2sql
conda install pytorch=1.5 cudatoolkit=10.2 -c pytorch
pip install -r requirements.txt
python -c " import nltk; nltk.download('stopwords'); nltk.download('punkt') "

下载数据集

pip install gdown
cd rat-sql-gap
gdown --id 1_AckYkinAnhqmRQtGsQgUKAnTHxxX5J0
unzip spider.zip
bash data/spider/generate.sh ./spider

构建数据集目录

mkdir data/spider-bart
cp ./spider/tables.json data/spider-bart/
cp ./spider/train_spider.json data/spider-bart/
cp ./spider/train_others.json data/spider-bart/
cp ./spider/dev.json data/spider-bart/
ln -s $( pwd ) /spider/database data/spider-bart/database

下载图书馆

mkdir third_party
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2018-10-05.zip
unzip stanford-corenlp-full-2018-10-05.zip -d third_party/

启动斯坦福图书馆

 pushd third_party/stanford-corenlp-full-2018-10-05
nohup java -mx4g -cp " * " edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 8999 -timeout 15000 > server.log &
popd

下载检查站

mkdir -p logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/
mkdir ie_dirs
aws s3 cp s3://gap-text2sql-public/checkpoint-artifacts/gap-finetuned-checkpoint logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/model_checkpoint-00041000

mkdir -p pretrained_checkpoint
aws s3 cp s3://gap-text2sql-public/checkpoint-artifacts/pretrained-checkpoint pretrained_checkpoint/pytorch_model.bin

另外，如果您没有AWSCLI：GAP-FINETNETED-CHACKPOINT和审计的检查点，则可以在此处下载它们

curl https://gap-text2sql-public.s3.amazonaws.com/checkpoint-artifacts/gap-finetuned-checkpoint -o logdir/bart_run_1/bs = 12 , lr = 1.0e-04 , bert_lr = 1.0e-05 , end_lr = 0e0 , att = 1/model_checkpoint-00041000
curl https://gap-text2sql-public.s3.amazonaws.com/checkpoint-artifacts/pretrained-checkpoint -o pretrained_checkpoint/pytorch_model.bin

预处理数据集

python run.py preprocess experiments/spider-configs/gap-run.jsonnet

推理

python run.py eval experiments/spider-configs/gap-run.jsonnet

然后，您可以在路径中获得推理结果和评估结果： ie_dirs/bart_run_1_true_1-step41000.infer和ie_dirs/bart_run_1_true_1-step41000.eval 。

训练

python run.py train experiments/spider-configs/gap-run.jsonnet

安全

有关更多信息，请参见贡献。

执照

该项目已根据APACHE-2.0许可获得许可。

展开

附加信息

版本 1.0.0
类型 Ai源码
更新时间 2025-09-10
大小 249.7KB
来自于 Github

gap text2sql

GAP-TEXT2SQL：学习语义解析的上下文表示，并通过一代增强的预训练

更新

抽象的

设置

下载数据集

构建数据集目录

下载图书馆

启动斯坦福图书馆

下载检查站

预处理数据集

推理

训练

安全

执照

GitHub sgrebnov/cordova plugin background download

Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

GitHub actions/download artifact

chat.petals.dev

GPT Prompt Templates

GPTyped

ML stack

awesome free chatgpt

pywin_contextmenu

Google Dorks

shepherd

mongo express