ดาวน์โหลด semantic segmentation - ดาวน์โหลดซอร์สโค้ด semantic segmentation

semantic segmentation

หลาม

v0.2.6

ดาวน์โหลด

การแบ่งส่วนความหมาย

ใช้งานง่ายและปรับแต่ง SOTA Semantic Semgentation รุ่นที่มีชุดข้อมูลมากมายใน Pytorch

แบนเนอร์

การทำใหม่ครั้งใหญ่! คอยติดตาม ...

มีการเปลี่ยนแปลงมากมายตั้งแต่ปี 2565 ทุกวันนี้มีแม้แต่แบบจำลองการแบ่งส่วนแบบเปิดโล่ง (ส่วนใดก็ได้) อย่างไรก็ตามแบบจำลองการแบ่งส่วนแบบดั้งเดิมยังคงเป็นที่ต้องการความแม่นยำสูงและกรณีการใช้งานที่กำหนดเอง repo นี้จะได้รับการอัปเดตตามรุ่น Pytorch ใหม่รุ่นที่อัปเดตและเอกสารประกอบสำหรับวิธีใช้กับชุดข้อมูลที่กำหนดเอง ฯลฯ

วันที่วางจำหน่ายที่คาดหวัง -> พฤษภาคม 2567

คุณสมบัติที่วางแผนไว้:

ทำใหม่เกี่ยวกับการฝึกอบรมทั้งหมด
รูปแบบพื้นฐานที่ผ่านการฝึกอบรมมาก่อน
แนวคิดที่อัปเดตใหม่
การรวมง่ายกับโมเดล Sota Backbone (พร้อมบทช่วยสอน)
การสอนสำหรับชุดข้อมูลที่กำหนดเอง
การฝึกอบรมแบบกระจาย

คุณสมบัติปัจจุบันที่จะถูกทิ้ง:

จำนวนชุดข้อมูลที่ให้ไว้จะลดลง แต่แทนตัวแทนจะยังคงมีการสอนสำหรับชุดข้อมูลที่กำหนดเอง
จำนวนของแบบจำลองที่ให้ไว้จะลดลง แต่จะยังคงมีกลเม็ดและโมดูลที่มีค่าและสามารถรวมเข้ากับรุ่นใด ๆ ได้อย่างง่ายดาย
การเสริมจะถูกแทนที่ด้วยการแปลง Torchvisionv2 อย่างเป็นทางการ
การแปลงและการอนุมานกับเฟรมเวิร์กอื่น ๆ

คุณสมบัติ

ใช้กับงานต่อไปนี้:
- การแยกฉาก
- การแยกวิเคราะห์ของมนุษย์
- การแยกวิเคราะห์
- การแบ่งส่วนภาพทางการแพทย์ (เร็ว ๆ นี้)
ชุดข้อมูล 20+
15+ sota backbones
โมเดลการแบ่งส่วนความหมายของ SOTA 10+
Pytorch, onnx, tflite, Openvino Export & Inference

สวนสัตว์รุ่น

รองรับแบ็คโบน:

Resnet (CVPR 2016)
Resnetd (Arxiv 2018)
Mobilenetv2 (CVPR 2018)
Mobilenetv3 (ICCV 2019)
MIT (Neurips 2021)
พักผ่อน (Neurips 2021)
Micronet (ICCV 2021)
Resnet+ (Arxiv 2021)
PVTV2 (CVMJ 2022)
Poolformer (CVPR 2022)
Convnext (CVPR 2022)
เครื่องแบบ (Arxiv 2022)
Van (Arxiv 2022)
Davit (Arxiv 2022)

หัว/วิธีที่รองรับ:

FCN (CVPR 2015)
Upernet (ECCV 2018)
Bisenetv1 (ECCV 2018)
FPN (CVPR 2019)
SFNET (ECCV 2020)
Segformer (Neurips 2021)
FAPN (ICCV 2021)
Condnet (IEEE SPL 2021)
Light-Ham (ICLR 2021)
Lawin (Arxiv 2022)
Topformer (CVPR 2022)

รุ่นสแตนด์อโลนที่รองรับ:

Bisenetv2 (IJCV 2021)
ddrnet (Arxiv 2021)

โมดูลที่รองรับ:

PPM (CVPR 2017)
PSA (Arxiv 2021)

อ้างถึงโมเดลสำหรับมาตรฐานและรุ่นที่ผ่านการฝึกอบรมมาก่อน

และตรวจสอบแบ็คโบนสำหรับแบ็คโบนที่รองรับ

หมายเหตุ: วิธีการส่วนใหญ่ไม่มีโมเดลที่ผ่านการฝึกอบรมมาก่อน เป็นการยากมากที่จะรวมโมเดลที่แตกต่างกับน้ำหนักที่ได้รับการฝึกอบรมไว้ล่วงหน้าในพื้นที่เก็บข้อมูลเดียวและทรัพยากรที่ จำกัด เพื่อฝึกฝนตัวเองอีกครั้ง

ชุดข้อมูลที่รองรับ

การแยกวิเคราะห์ฉาก:

ADE20K
ทิวทัศน์เมือง
โคโค่สเตค
Camvid
เพรียวบาง
Mapillary Vistas
Sun RGB-D

การแยกวิเคราะห์ของมนุษย์:

MHPV2
MHPV1
ริมฝีปาก
CCIHP
CIHP
ATR

การแยกวิเคราะห์หน้า:

เฮเลน
Lapa
Ibugmask
celebamaskhq
Facesynthetics

คนอื่น:

ซะ

อ้างถึงชุดข้อมูลสำหรับรายละเอียดเพิ่มเติมและการเตรียมชุดข้อมูล

Augmentations พร้อมใช้งาน (คลิกเพื่อขยาย)

ตรวจสอบสมุดบันทึกที่นี่เพื่อทดสอบเอฟเฟกต์การเสริม

การแปลงระดับพิกเซล:

colorjitter (ความสว่าง, ความคมชัด, ความอิ่มตัว, สี)
แกมม่า, ความคมชัด, autocontrast, equalize, posterize
Gaussianblur, Grayscale

การแปลงระดับเชิงพื้นที่:

Affine, RandomRotation
Horizontalflip, Verticalflip
centercrop, สุ่มเพาะ
Pad, resizepad, ปรับขนาด
สุ่ม

การใช้งาน

การติดตั้ง

Python> = 3.6
Torch> = 1.8.1
Torchvision> = 0.9.1

จากนั้นโคลน repo และติดตั้งโครงการด้วย:

$ git clone https://github.com/sithu31296/semantic-segmentation
$ cd semantic-segmentation
$ pip install -e .

การกำหนดค่า (คลิกเพื่อขยาย)

สร้างไฟล์การกำหนดค่าใน configs ตัวอย่างการกำหนดค่าสำหรับชุดข้อมูล ADE20K สามารถดูได้ที่นี่ จากนั้นแก้ไขฟิลด์ที่คุณคิดว่าจำเป็น ไฟล์การกำหนดค่านี้จำเป็นสำหรับสคริปต์การฝึกอบรมการประเมินผลและการทำนายทั้งหมด

การฝึกอบรม (คลิกเพื่อขยาย)

เพื่อฝึกด้วย GPU เดียว:

$ python tools/train.py --cfg configs/CONFIG_FILE.yaml

หากต้องการฝึกอบรมด้วย GPU หลายตัวให้ตั้งค่าฟิลด์ DDP ในไฟล์กำหนดค่าเป็น true และทำงานดังนี้:

$ python -m torch.distributed.launch --nproc_per_node=2 --use_env tools/train.py --cfg configs/ < CONFIG_FILE_NAME > .yaml

การประเมินผล (คลิกเพื่อขยาย)

ตรวจสอบให้แน่ใจว่าได้ตั้ง MODEL_PATH ของไฟล์การกำหนดค่าไปยังไดเรกทอรีโมเดลที่ผ่านการฝึกอบรมของคุณ

$ python tools/val.py --cfg configs/ < CONFIG_FILE_NAME > .yaml

ในการประเมินด้วยหลายระดับและพลิกให้เปลี่ยนฟิลด์ ENABLE ใน MSF เป็น true และเรียกใช้คำสั่งเดียวกันกับด้านบน

การอนุมาน

หากต้องการทำการอนุมานให้แก้ไขพารามิเตอร์ของไฟล์กำหนดค่าจากด้านล่าง

เปลี่ยน MODEL >> NAME และ BACKBONE เป็นโมเดลที่คุณต้องการ
เปลี่ยน DATASET >> NAME เป็นชื่อชุดข้อมูลขึ้นอยู่กับโมเดลที่ผ่านการฝึกอบรม
ตั้งค่า TEST >> MODEL_PATH ถึงน้ำหนักที่ได้รับการแก้ไขของแบบจำลองการทดสอบ
เปลี่ยน TEST >> FILE ไปยังเส้นทางไฟล์หรือโฟลเดอร์รูปภาพที่คุณต้องการทดสอบ
ผลการทดสอบจะถูกบันทึกไว้ใน SAVE_DIR

 # # example using ade20k pretrained models
$ python tools/infer.py --cfg configs/ade20k.yaml

ตัวอย่างผลการทดสอบ (Segformer-B2):

test_result

แปลงเป็นเฟรมเวิร์กอื่น ๆ (onnx, coreml, openvino, tflite)

ในการแปลงเป็น ONNX และ COREML, RUN:

$ python tools/export.py --cfg configs/ < CONFIG_FILE_NAME > .yaml

หากต้องการแปลงเป็น openvino และ tflite ให้ดู Torch_optimize

การอนุมาน (onnx, openvino, tflite)

 # # ONNX Inference
$ python scripts/onnx_infer.py --model < ONNX_MODEL_PATH > --img-path < TEST_IMAGE_PATH >

# # OpenVINO Inference
$ python scripts/openvino_infer.py --model < OpenVINO_MODEL_PATH > --img-path < TEST_IMAGE_PATH >

# # TFLite Inference
$ python scripts/tflite_infer.py --model < TFLite_MODEL_PATH > --img-path < TEST_IMAGE_PATH >

ข้อมูลอ้างอิง (คลิกเพื่อขยาย)

https://github.com/coincheung/bisenet
https://github.com/open-mmlab/mmsegmentation
https://github.com/rwightman/pytorch-image-models

การอ้างอิง (คลิกเพื่อขยาย)

 @article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

@misc{xiao2018unified,
  title={Unified Perceptual Parsing for Scene Understanding}, 
  author={Tete Xiao and Yingcheng Liu and Bolei Zhou and Yuning Jiang and Jian Sun},
  year={2018},
  eprint={1807.10221},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@article{hong2021deep,
  title={Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes},
  author={Hong, Yuanduo and Pan, Huihui and Sun, Weichao and Jia, Yisong},
  journal={arXiv preprint arXiv:2101.06085},
  year={2021}
}

@misc{zhang2021rest,
  title={ResT: An Efficient Transformer for Visual Recognition}, 
  author={Qinglong Zhang and Yubin Yang},
  year={2021},
  eprint={2105.13677},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{huang2021fapn,
  title={FaPN: Feature-aligned Pyramid Network for Dense Image Prediction}, 
  author={Shihua Huang and Zhichao Lu and Ran Cheng and Cheng He},
  year={2021},
  eprint={2108.07058},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{wang2021pvtv2,
  title={PVTv2: Improved Baselines with Pyramid Vision Transformer}, 
  author={Wenhai Wang and Enze Xie and Xiang Li and Deng-Ping Fan and Kaitao Song and Ding Liang and Tong Lu and Ping Luo and Ling Shao},
  year={2021},
  eprint={2106.13797},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@article{Liu2021PSA,
  title={Polarized Self-Attention: Towards High-quality Pixel-wise Regression},
  author={Huajun Liu and Fuqiang Liu and Xinyi Fan and Dong Huang},
  journal={Arxiv Pre-Print arXiv:2107.00782 },
  year={2021}
}

@misc{chao2019hardnet,
  title={HarDNet: A Low Memory Traffic Network}, 
  author={Ping Chao and Chao-Yang Kao and Yu-Shan Ruan and Chien-Hsiang Huang and Youn-Long Lin},
  year={2019},
  eprint={1909.00948},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@inproceedings{sfnet,
  title={Semantic Flow for Fast and Accurate Scene Parsing},
  author={Li, Xiangtai and You, Ansheng and Zhu, Zhen and Zhao, Houlong and Yang, Maoke and Yang, Kuiyuan and Tong, Yunhai},
  booktitle={ECCV},
  year={2020}
}

@article{Li2020SRNet,
  title={Towards Efficient Scene Understanding via Squeeze Reasoning},
  author={Xiangtai Li and Xia Li and Ansheng You and Li Zhang and Guang-Liang Cheng and Kuiyuan Yang and Y. Tong and Zhouchen Lin},
  journal={ArXiv},
  year={2020},
  volume={abs/2011.03308}
}

@ARTICLE{Yucondnet21,
  author={Yu, Changqian and Shao, Yuanjie and Gao, Changxin and Sang, Nong},
  journal={IEEE Signal Processing Letters}, 
  title={CondNet: Conditional Classifier for Scene Segmentation}, 
  year={2021},
  volume={28},
  number={},
  pages={758-762},
  doi={10.1109/LSP.2021.3070472}
}

@misc{yan2022lawin,
  title={Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention}, 
  author={Haotian Yan and Chuang Zhang and Ming Wu},
  year={2022},
  eprint={2201.01615},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{yu2021metaformer,
  title={MetaFormer is Actually What You Need for Vision}, 
  author={Weihao Yu and Mi Luo and Pan Zhou and Chenyang Si and Yichen Zhou and Xinchao Wang and Jiashi Feng and Shuicheng Yan},
  year={2021},
  eprint={2111.11418},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{wightman2021resnet,
  title={ResNet strikes back: An improved training procedure in timm}, 
  author={Ross Wightman and Hugo Touvron and Hervé Jégou},
  year={2021},
  eprint={2110.00476},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{liu2022convnet,
  title={A ConvNet for the 2020s}, 
  author={Zhuang Liu and Hanzi Mao and Chao-Yuan Wu and Christoph Feichtenhofer and Trevor Darrell and Saining Xie},
  year={2022},
  eprint={2201.03545},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@misc{li2022uniformer,
  title={UniFormer: Unifying Convolution and Self-attention for Visual Recognition}, 
  author={Kunchang Li and Yali Wang and Junhao Zhang and Peng Gao and Guanglu Song and Yu Liu and Hongsheng Li and Yu Qiao},
  year={2022},
  eprint={2201.09450},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน v0.2.6
ประเภท หลาม
เวลาอัปเดต 2025-07-12
ขนาด 2.32MB
มาจาก Github

แอปที่เกี่ยวข้อง

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
ToDo Co

หลาม

1.0.0
Python Portfolio

หลาม
เครื่องมือแผนภูมิข้อมูลโอเพ่นซอร์ส Redash v24.10.0

หลาม

24.10.0
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3

ข้อมูลที่เกี่ยวข้อง ทั้งหมด