markdrop Download - markdrop Source Code Download

markdrop

كود الذكاء الاصطناعي

1.0.0

تنزيل

ماركدروب

حزمة Python لتحويل PDFs (أو عناوين URL PDF) إلى تخفيض المعدل أثناء استخراج الصور والجداول. يجعل MarkDrop من السهل تحويل مستندات PDF إلى تنسيق Markdown مع الحفاظ على الصور والجداول.

سمات

PDF لتحويل تحويل مع الحفاظ على التنسيق باستخدام DoCling
استخراج الصورة التلقائي مع الحفاظ على الجودة باستخدام معرف XREF
اكتشاف الجدول باستخدام محول جدول Microsoft
دعم عنوان URL لـ PDF لـ Three Three وظائف
أوصاف وصفية نصي لأي ملف صورة أو مجلد
التعرف على الأحرف البصرية (OCR) للصور ذات النص المضمّن
الدعم المعزز لتنسيقات الإخراج المنظمة (على سبيل المثال ، JSON ، YAML)
دعم PDF متعدد اللغات

تثبيت

pip install markdrop

https://pypi.org/project/markdrop

بداية سريعة

 from markdrop import extract_images , make_markdown , extract_tables_from_pdf

source_pdf = 'url/or/path/to/pdf/file'    # Replace with your local PDF file path or a URL
output_dir = 'data/output'                # Replace it with desired output directory's path

make_markdown ( source_pdf , output_dir )
extract_images ( source_pdf , output_dir , verbose = True )
extract_tables_from_pdf ( source_pdf , output_dir = output_dir )

 from markdrop import setup_keys

### API Key Setup
### If using 'openai' or 'gemini' as llm_client in the generate_descriptions function, you need to set up the API keys first.

setup_keys ()

 from markdrop import generate_descriptions

### Image Descriptions Generation

prompt = "Give textual highly detailed descriptions from this image ONLY, nothing else." # Replace it with your desired prompt
input_path = 'path/to/img_file/or/dir'    # Replace it with the path to the images dir or image file
output_dir = 'data/output'                # Replace it with the desired output directory's path
llm_clients = [ 'gemini' , 'llama-vision' ]        # Replace it with the desired models from ['qwen', 'gemini', 'openai', 'llama-vision', 'molmo', 'pixtral'] only

generate_descriptions ( input_path = input_path , output_dir = output_dir , prompt = prompt , llm_client = llm_clients )

مرجع API

Make_markdown (المصدر ، Output_dir ، مطوهر = خطأ)

يحول PDF أو عنوان URL إلى تنسيق تخفيض.

حدود:

source (STR): مسار لإدخال PDF أو URL
output_dir (Str): مسار دليل الإخراج
verbose (Bool): تمكين تسجيل تفصيلي

extract_images (المصدر ، output_dir ، مطول = خطأ)

يستخلص الصور من PDF أو عنوان URL الخاص به مع الحفاظ على الجودة.

حدود:

source (STR): مسار لإدخال PDF أو URL
output_dir (Str): مسار دليل الإخراج
verbose (Bool): تمكين تسجيل تفصيلي

extract_tables_from_pdf (pdf_path ، ** kwargs)

يكتشف ويستخلص صور الجداول.

حدود:

pdf_path (STR): مسار إلى إدخال PDF أو URL
start_page (int ، اختياري): رقم الصفحة بدء
end_page (int ، اختياري): رقم الصفحة إنهاء
threshold (تعويم ، اختياري): عتبة الثقة الكشف
output_dir (Str): مسار دليل الإخراج

cender_descriptions (input_path ، output_dir ، مطالبة ، llm_client)

يولد وصف الصور (الصور) استنادًا إلى موجه معين و llm_client في CSV

llm clients المدعومون هم [Qwen "،" Gemini "،" Openai "،" Llama-Vision "،" Molmo "،" Pixtral "]

حدود:

input_path (str): مسار إلى إدخال pdf أو عنوان URL
output_dir (Str): مسار دليل الإخراج
prompt (STR): موجه يتم إرساله إلى النموذج مع الصورة
llm_client (قائمة): قائمة تحتوي على نموذج واحد على الأقل من عملاء LLM

Analyze_PDF_IMAGES (المصدر ، الإخراج _dir ، مطوّل = خطأ):

تحليل أنواع مختلفة من مراجع الصور في PDF من الملف المحلي أو عنوان URL

حدود:

source (STR): مسار PDF المحلي أو عنوان URL إلى PDF
output_dir (str): دليل للملفات المؤقتة
verbose (منطقي): طباعة معلومات مفصلة

المساهمة

نرحب بالمساهمات! يرجى الاطلاع على إرشاداتنا المساهمة للحصول على التفاصيل.

إعداد التنمية

استنساخ المستودع:

git clone https://github.com/shoryasethia/markdrop.git  
cd markdrop

إنشاء بيئة افتراضية:

python -m venv venv  
source venv/bin/activate  # On Windows: venvScriptsactivate

تثبيت تبعيات التنمية:

pip install -r requirements.txt

هيكل المشروع

markdrop/  
├── LICENSE  
├── README.md  
├── CONTRIBUTING.md  
├── CHANGELOG.md  
├── requirements.txt  
├── setup.py  
└── markdrop/ 
    ├── models/
    |   ├── .env
    |   ├── img_descriptions.py
    |   ├── logger.py
    |   ├── model_loader.py
    |   ├── responder.py
    |   └── setup_keys.py
    ├── __init__.py  
    ├── main.py  
    ├── utils.py  
    ├── helper.py
    └── ignore_warnings.py