llm elasticsearch cache Download - llm elasticsearch cache do download do código fonte

llm elasticsearch cache

Código-Fonte de IA

1.0.0

Baixar

Importante

Esta biblioteca agora faz parte de Langchain, siga a documentação oficial, por exemplo, o cache LLM

LLM-ELASTICSearch-Cache

Uma camada de armazenamento em cache para LLMs que exploram o Elasticsearch, totalmente compatíveis com o cache de Langchain, tanto para os modelos de bate -papo quanto de incorporação.

Instalar

pip install llm-elasticsearch-cache

Uso do cache de bate -papo

O cache Langchain pode ser usado de maneira semelhante às outras integrações do cache.

Exemplo básico

 from langchain . globals import set_llm_cache
from llmescache . langchain import ElasticsearchCache
from elasticsearch import Elasticsearch

es_client = Elasticsearch ( hosts = "http://localhost:9200" )
set_llm_cache (
    ElasticsearchCache (
        es_client = es_client , 
        es_index = "llm-chat-cache" , 
        metadata = { "project" : "my_chatgpt_project" }
    )
)

O parâmetro es_index também pode levar aliases. Isso permite usar o ILM: gerencie o ciclo de vida do índice que sugerimos considerar para gerenciar a retenção e controlar o crescimento do cache.

Veja a classe Docstring para todos os parâmetros.

Indexar o texto gerado

Os dados em cache não serão pesquisáveis por padrão. O desenvolvedor pode personalizar a construção do documento Elasticsearch para adicionar campos de texto indexados, onde colocar, por exemplo, o texto gerado pelo LLM.

Isso pode ser feito subclassificando métodos de substituição final. A nova classe de cache pode ser aplicada também a um índice de cache pré-existente:

 from llmescache . langchain import ElasticsearchCache
from elasticsearch import Elasticsearch
from langchain_core . caches import RETURN_VAL_TYPE
from typing import Any , Dict , List
from langchain . globals import set_llm_cache
import json


class SearchableElasticsearchCache ( ElasticsearchCache ):

    @ property
    def mapping ( self ) -> Dict [ str , Any ]:
        mapping = super (). mapping
        mapping [ "mappings" ][ "properties" ][ "parsed_llm_output" ] = { "type" : "text" , "analyzer" : "english" }
        return mapping
    
    def build_document ( self , prompt : str , llm_string : str , return_val : RETURN_VAL_TYPE ) -> Dict [ str , Any ]:
        body = super (). build_document ( prompt , llm_string , return_val )
        body [ "parsed_llm_output" ] = self . _parse_output ( body [ "llm_output" ])
        return body

    @ staticmethod
    def _parse_output ( data : List [ str ]) -> List [ str ]:
        return [ json . loads ( output )[ "kwargs" ][ "message" ][ "kwargs" ][ "content" ] for output in data ]


es_client = Elasticsearch ( hosts = "http://localhost:9200" )
set_llm_cache ( SearchableElasticsearchCache ( es_client = es_client , es_index = "llm-chat-cache" ))

Uso do cache de incorporação

O cache de incorporação é obtido usando o Cachebackedembeddings, de uma maneira ligeiramente diferente da documentação oficial.

 from llmescache . langchain import ElasticsearchStore
from elasticsearch import Elasticsearch
from langchain . embeddings import CacheBackedEmbeddings
from langchain_openai import OpenAIEmbeddings

es_client = Elasticsearch ( hosts = "http://localhost:9200" )

underlying_embeddings = OpenAIEmbeddings ( model = "text-embedding-3-small" )
store = ElasticsearchStore (
    es_client = es_client , 
    es_index = "llm-embeddings-cache" ,
    namespace = underlying_embeddings . model ,
    metadata = { "project" : "my_llm_project" }
)
cached_embeddings = CacheBackedEmbeddings (
    underlying_embeddings , 
    store
)

Da mesma forma que o cache de bate -papo, pode -se subclasse ElasticsearchStore para indexar vetores para pesquisa.

 from llmescache . langchain import ElasticsearchStore
from typing import Any , Dict , List

class SearchableElasticsearchStore ( ElasticsearchStore ):

    @ property
    def mapping ( self ) -> Dict [ str , Any ]:
        mapping = super (). mapping
        mapping [ "mappings" ][ "properties" ][ "vector" ] = { "type" : "dense_vector" , "dims" : 1536 , "index" : True , "similarity" : "dot_product" }
        return mapping
    
    def build_document ( self , llm_input : str , vector : List [ float ]) -> Dict [ str , Any ]:
        body = super (). build_document ( llm_input , vector )
        body [ "vector" ] = vector
        return body

Esteja ciente de que atualmente não suporta consultas de cache, isso significa que as consultas de texto, para pesquisas CacheBackedEmbeddings vetores, não serão armazenadas em cache. No entanto, ao substituir o método embed_query , é possível implementá -lo facilmente.

Expandir

Informações adicionais

Versão 1.0.0
Tipo Código-Fonte de IA
Data da Última Atualização 2025-07-01
tamanho 64.51KB
Vindo de Github

Aplicativos Relacionados

TensorRT LLM

2024-11-10
GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
elasticsearch

2024-11-01

Recomendado para você

chat.petals.dev

Outro código-fonte

1.0.0
GPT Prompt Templates

Outro código-fonte

1.0.0
GPTyped

Outro código-fonte

GPTyped 1.0.5
ML stack

Código-Fonte de IA

1.0.0
awesome free chatgpt

Código-Fonte de IA

1.0.0
promptl

Código-Fonte de IA

1.0.0
Google Dorks

Outro código-fonte

1.0
shepherd

Outro código-fonte

v6.1.6-react-shepherd: Prepare Release (#3063)
hidusbf

Outro código-fonte

1.0.0

Informações Relacionadas Todos