PerceiverIO_Pytorch ดาวน์โหลด - PerceiverIO_Pytorch ซอร์สโค้ดดาวน์โหลดดาวน์โหลด

PerceiverIO_Pytorch

โค้ดแหล่งที่มา AI

1.0.0

ดาวน์โหลด

Perceiverio Pytorch

การปรับรูปแบบการรับรู้ของ DeepMind (https://arxiv.org/abs/2103.03206) เป็น Pytorch รหัส JAX/HAIKU ดั้งเดิมสามารถดูได้ที่นี่: https://github.com/deepmind/deepmind-research/tree/master/perceiver

การติดตั้ง

โคลนที่เก็บ:

git clone https://github.com/JOBR0/PerceiverIO_Pytorch
cd PerceiverIO_Pytorch

สร้างสภาพแวดล้อมเสมือนจริงและเปิดใช้งาน:

python3 -m venv perceiverEnv
source perceiverEnv/bin/activate

ติดตั้ง pytorch ตามคำแนะนำอย่างเป็นทางการ: https://pytorch.org/get-started/locally/
ติดตั้งแพ็คเกจอื่น ๆ ที่จำเป็นจาก chendreet.txt:

pip3 install -r requirements.txt

ตัวอย่าง

การใช้งานครอบคลุมงานตัวอย่างต่อไปนี้ซึ่งมีโมเดลที่มีการปรับแต่ง:

การสร้างแบบจำลองภาษาที่สวมหน้ากาก (ตัวอย่าง _language.py)
การจำแนกรูปภาพ (ตัวอย่าง _img_classify.py)
การเข้ารหัสอัตโนมัติวิดีโอหลายโมดอล (example_multimodal.py)
การประมาณค่าการไหลของแสง (ตัวอย่าง _opt_flow.py)

นางแบบที่ได้รับการฝึกฝน

จุดตรวจไฮกุจากที่เก็บ DeepMind อย่างเป็นทางการได้รับการแปลงเป็นจุดตรวจ Pytorch และสามารถดาวน์โหลดได้จาก Google-Drive ควรวางจุดตรวจ Pytorch ในโฟลเดอร์ 'pytorch_checkpoints' เพื่อให้รหัสตัวอย่างสามารถค้นหาได้

การใช้งาน

ในการสร้าง preceeiverio ใหม่สำหรับงานที่กำหนดเองคลาส Perceiver ใน Perceiver_io/Perceiver.py ใช้

 class PerceiverIO ( nn . Module ):
    """The Perceiver: a scalable, fully attentional architecture.
    Args:
        num_blocks (int): Number of times the block is applied with shared weights. Default: 8
        num_self_attends_per_block (int): Number of self-attentions in the block. Default: 6,
        num_latents: (int): Number of latent vectors. Default 512,
        num_latent_channels (int): Number of channels for the latent vectors. Default: 1024,
        final_project (bool): Whether to apply a linear layer to the outputs before the post-processors. Default: True,
        final_project_out_channels (int): Number of output channels for the final projection layer. Default: None,
        perceiver_encoder_kwargs (Dict): Additional arguments for the perceiver encoder class. Default: {},
        perceiver_decoder_kwargs (Dict): Additional arguments for the perceiver decoder class. Default: {},
        input_preprocessors (dict / nn.Module): Optional input preprocessors. 1 or none for each modality. Default: None,
        output_postprocessors (dict / nn.Module): Optional output postprocessors. 1 or none for each modality. Default: None,
        output_queries (dict / nn.Module): Modules that create the output queries. 1 for each modality. Default: None,
        output_query_padding_channels (int): Number of learnable features channels that are added to the output queries. Default: 0,
        input_padding_channels (int): Number of learnable features channels that are added to the preprocessed inputs. Default: 0,
        input_channels (dict, int): = The number of input channels need to be specified if NO preprocessor is used. Otherwise,
                                    the number will be inferred from the preprocessor. Default: None,
        input_mask_probs (dict): Probability with which each input modality will be masked out. Default None,
    """

ต่อไปนี้เป็นไดอะแกรมของ Perceiverio สำหรับแอปพลิเคชันหลายรูปแบบ:

อินพุต preprocessors (ไม่บังคับ)

ตัวประมวลผลล่วงหน้าอินพุตใช้ข้อมูลอินพุตดิบและประมวลผลล่วงหน้าเพื่อให้สามารถสอบถามได้โดยการแทรกแซงข้ามครั้งแรก นี่อาจเป็นเช่นการสร้างแพตช์จากภาพ โดยปกติแล้วการเข้ารหัสตำแหน่งจะถูกรวมเข้าด้วยกันโดยตัวประมวลผลล่วงหน้า แทนที่จะใช้ตัวประมวลผลล่วงหน้าอินพุตสามารถประมวลผลด้วยตนเองได้

input_preprocessors หลายตัวสามารถพบได้ใน perceiver_io/io_processors/preprocessors.py

เอาท์พุท postprocessors (ไม่บังคับ)

เอาท์พุท postprocessors ใช้ผลลัพธ์สุดท้ายของการรับรู้และประมวลผลเพื่อให้ได้รูปแบบเอาต์พุตที่ต้องการ

Output_postprocessors หลายตัวสามารถพบได้ใน perceiver_io/io_processors/postprocessors.py

แบบสอบถามเอาท์พุท

Queries ouput สร้างคุณสมบัติที่ใช้ในการสืบค้นการแสดงแฝงครั้งสุดท้ายของการรับรู้เพื่อสร้างผลลัพธ์ พวกเขาได้รับอินพุตที่ประมวลผลล่วงหน้าเป็นอาร์กิวเมนต์เพื่อให้พวกเขาสามารถใช้งานได้หากต้องการ พวกเขามักจะรวมการเข้ารหัสตำแหน่ง

output_queries หลายสามารถพบได้ใน perceiver_io/output_queries.py ### หลาย modalities เพื่อประมวลผลหลาย modalities ในครั้งเดียวพจนานุกรมที่มีการแมปจากโมเดลไปยังโมดูลสามารถใช้สำหรับตัวประมวลผลอินพุต _preprocessors, output_postprocessors เพื่อให้อินพุตที่แตกต่างกันเข้ากันได้ซึ่งกันและกันพวกเขาจะถูกเบาะเป็นขนาดช่องสัญญาณเดียวกันกับพารามิเตอร์ที่สามารถฝึกอบรมได้ นอกจากนี้ยังเป็นไปได้ที่จะใช้จำนวนการสืบค้นเอาต์พุตที่แตกต่างจากอินพุต

การอ้างอิง

 @misc { jaegle2021perceiver ,
    title   = { Perceiver IO: A General Architecture for Structured Inputs & Outputs } ,
    author  = { Andrew Jaegle and Sebastian Borgeaud and Jean-Baptiste Alayrac and Carl Doersch and Catalin Ionescu and David Ding and Skanda Koppula and Andrew Brock and Evan Shelhamer and Olivier Hénaff and Matthew M. Botvinick and Andrew Zisserman and Oriol Vinyals and João Carreira } ,
    year    = { 2021 } ,
    eprint  = { 2107.14795 } ,
    archivePrefix = { arXiv } ,
    primaryClass = { cs.LG }
}

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน 1.0.0
ประเภท โค้ดแหล่งที่มา AI
เวลาอัปเดต 2025-09-09
ขนาด 2.71MB
มาจาก Github

แอปที่เกี่ยวข้อง

OpenCore_NO_ACPI_Build

2024-11-13
nspanel_pro_tools_apk

2024-11-12
zkwork_aleo_gpu_worker

2024-11-11
pytorch image models

2024-11-03
nextcloud_share_url_downloader

2024-11-01
เครื่องมือวิเคราะห์ข้อมูล Lihua เวอร์ชันฟรี 3.0_search_navigation_collection_public comment_ranking_api

2022-06-28

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
ML stack

โค้ดแหล่งที่มา AI

1.0.0
awesome free chatgpt

โค้ดแหล่งที่มา AI

1.0.0
pywin_contextmenu

โค้ดแหล่งที่มา AI

Version update
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3

ข้อมูลที่เกี่ยวข้อง ทั้งหมด