ดาวน์โหลด markdrop - ดาวน์โหลดซอร์สโค้ด markdrop

markdrop

โค้ดแหล่งที่มา AI

1.0.0

ดาวน์โหลด

มาร์คโดรป

แพ็คเกจ Python สำหรับการแปลง PDFs (หรือ URL PDF) เป็น markdown ในขณะที่แยกรูปภาพและตาราง Markdrop ทำให้ง่ายต่อการแปลงเอกสาร PDF เป็นรูปแบบ Markdown ในขณะที่รักษาภาพและตาราง

คุณสมบัติ

การแปลง PDF เป็น markdown พร้อมการเก็บรักษาการจัดรูปแบบโดยใช้ docling
การสกัดภาพอัตโนมัติพร้อมการเก็บรักษาคุณภาพโดยใช้ XREF ID
การตรวจจับตารางโดยใช้หม้อแปลงตารางของ Microsoft
การสนับสนุน URL PDF สำหรับฟังก์ชันสามฟังก์ชั่นข้างต้น
คำอธิบายคำอธิบายเชิงข้อความสำหรับไฟล์ภาพหรือโฟลเดอร์ใด ๆ
การจดจำอักขระออพติคอล (OCR) สำหรับรูปภาพที่มีข้อความฝังตัว
การสนับสนุนที่เพิ่มขึ้นสำหรับรูปแบบเอาต์พุตที่มีโครงสร้าง (เช่น JSON, YAML)
รองรับ PDF หลายภาษา

การติดตั้ง

pip install markdrop

https://pypi.org/project/markdrop

เริ่มต้นอย่างรวดเร็ว

 from markdrop import extract_images , make_markdown , extract_tables_from_pdf

source_pdf = 'url/or/path/to/pdf/file'    # Replace with your local PDF file path or a URL
output_dir = 'data/output'                # Replace it with desired output directory's path

make_markdown ( source_pdf , output_dir )
extract_images ( source_pdf , output_dir , verbose = True )
extract_tables_from_pdf ( source_pdf , output_dir = output_dir )

 from markdrop import setup_keys

### API Key Setup
### If using 'openai' or 'gemini' as llm_client in the generate_descriptions function, you need to set up the API keys first.

setup_keys ()

 from markdrop import generate_descriptions

### Image Descriptions Generation

prompt = "Give textual highly detailed descriptions from this image ONLY, nothing else." # Replace it with your desired prompt
input_path = 'path/to/img_file/or/dir'    # Replace it with the path to the images dir or image file
output_dir = 'data/output'                # Replace it with the desired output directory's path
llm_clients = [ 'gemini' , 'llama-vision' ]        # Replace it with the desired models from ['qwen', 'gemini', 'openai', 'llama-vision', 'molmo', 'pixtral'] only

generate_descriptions ( input_path = input_path , output_dir = output_dir , prompt = prompt , llm_client = llm_clients )

การอ้างอิง API

Make_markdown (แหล่งที่มา, output_dir, verbose = false)

แปลง PDF หรือ URL เป็นรูปแบบ markdown

พารามิเตอร์:

source (STR): เส้นทางไปยัง PDF หรือ URL
output_dir (STR): เส้นทางไดเรกทอรีเอาต์พุต
verbose (บูล): เปิดใช้งานการบันทึกรายละเอียด

extract_images (แหล่งที่มา, output_dir, verbose = false)

แยกภาพจาก PDF หรือ URL ในขณะที่รักษาคุณภาพ

พารามิเตอร์:

source (STR): เส้นทางไปยัง PDF หรือ URL
output_dir (STR): เส้นทางไดเรกทอรีเอาต์พุต
verbose (บูล): เปิดใช้งานการบันทึกรายละเอียด

extract_tables_from_pdf (pdf_path, ** kwargs)

ตรวจจับและแยกภาพตาราง

พารามิเตอร์:

pdf_path (str): เส้นทางไปยังอินพุต pdf หรือ url
start_page (int, ไม่บังคับ): หมายเลขหน้าเริ่มต้น
end_page (int, ไม่บังคับ): หมายเลขหน้าสิ้นสุด
threshold (Float, เสริม): Threshold ความเชื่อมั่นในการตรวจจับ
output_dir (STR): เส้นทางไดเรกทอรีเอาต์พุต

generate_descriptions (input_path, output_dir, พรอมต์, llm_client)

สร้างคำอธิบายของภาพตามพรอมต์ที่กำหนดและ llm_client ใน CSV

llm clients ได้รับการสนับสนุนคือ ['Qwen', 'Gemini', 'Openai', 'Llama-Vision', 'Molmo', 'Pixtral']

พารามิเตอร์:

input_path (str): เส้นทางไปยังอินพุต pdf หรือ url
output_dir (STR): เส้นทางไดเรกทอรีเอาต์พุต
prompt (str): แจ้งให้ส่งไปยังแบบจำลองพร้อมกับรูปภาพ
llm_client (รายการ): รายการที่มีรุ่นขั้นต่ำหนึ่งรุ่นจากไคลเอนต์ LLM

วิเคราะห์ _pdf_images (แหล่งที่มา, output_dir, verbose = false):

วิเคราะห์การอ้างอิงรูปภาพประเภทต่างๆใน PDF จากไฟล์ท้องถิ่นหรือ URL

พารามิเตอร์:

source (STR): เส้นทาง PDF ท้องถิ่นหรือ URL ไปยัง PDF
output_dir (str): ไดเรกทอรีสำหรับไฟล์ชั่วคราว
verbose (บูล): พิมพ์ข้อมูลรายละเอียด

การบริจาค

เรายินดีต้อนรับผลงาน! โปรดดูแนวทางการสนับสนุนของเราสำหรับรายละเอียด

การตั้งค่าการพัฒนา

โคลนที่เก็บ:

git clone https://github.com/shoryasethia/markdrop.git  
cd markdrop

สร้างสภาพแวดล้อมเสมือนจริง:

python -m venv venv  
source venv/bin/activate  # On Windows: venvScriptsactivate

ติดตั้งการพัฒนาการพัฒนา:

pip install -r requirements.txt

โครงสร้างโครงการ

markdrop/  
├── LICENSE  
├── README.md  
├── CONTRIBUTING.md  
├── CHANGELOG.md  
├── requirements.txt  
├── setup.py  
└── markdrop/ 
    ├── models/
    |   ├── .env
    |   ├── img_descriptions.py
    |   ├── logger.py
    |   ├── model_loader.py
    |   ├── responder.py
    |   └── setup_keys.py
    ├── __init__.py  
    ├── main.py  
    ├── utils.py  
    ├── helper.py
    └── ignore_warnings.py