autodistill gpt 4v 다운로드 autodistill gpt 4v 소스 코드 다운로드

autodistill gpt 4v

기타 소스코드

1.0.0

다운로드

AutoDistill GPT-4V 모듈

이 저장소에는 AutoDistill과 함께 사용할 GPT-4V 기본 모델을 지원하는 코드가 포함되어 있습니다.

OpenAI가 개발 한 GPT-4V는 멀티 모달 언어 모델입니다. GPT-4V를 사용하면 자연 언어의 이미지에 대한 질문을 할 수 있습니다. autodistill-gpt4v 모듈을 사용하면 GPT-4V를 사용하여 이미지를 분류 할 수 있습니다.

이 모델은 2023 년 11 월 6 일 OpenAI가 발표 한 GPT-4- 비전 예방 API를 사용합니다.

메모

이 프로젝트를 사용하면 OpenAI GPT-4 Vision API에 대한 API 호출에 대한 청구 요금이 발생합니다. 자세한 내용은 OpenAI 가격 책정 페이지를 참조하고 예상 가격 책정을 계산하십시오. 이 패키지는 레이블을 지정하려는 이미지 당 하나의 API 호출을 만듭니다.

전체 자동 장비 문서를 읽으십시오.

GPT-4V Autodistill 문서를 읽으십시오.

설치

AutoDistill과 함께 GPT-4V를 사용하려면 다음 종속성을 설치해야합니다.

pip3 install autodistill-gpt-4v

QuickStart

 from autodistill_gpt_4v import GPT4V

# define an ontology to map class names to our GPT-4V prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = GPT4V (
    ontology = CaptionOntology (
        {
            "person" : "person" ,
            "a forklift" : "forklift"
        }
    ),
    api_key = "OPENAI_API_KEY"
)
base_model . label ( "./context_images" , extension = ".jpeg" )