aggregators.jusbr.client.JusbrScraper

aggregators.jusbr.client.JusbrScraper(
    verbose=0,
    download_path=None,
    sleep_time=0.5,
    token=None,
)

Raspador para o JusBR (consulta unificada da PDPJ-CNJ). Este scraper interage com a API da Plataforma Digital do Poder Judiciario (PDPJ).

Methods

Name Description
auth Define o token JWT para autenticacao e o decodifica para verificacao.
auth_firefox Authentication via Firefox.
cpopg Consulta processos pelo numero CNJ (ou lista de numeros CNJ) via API nacional.
download_documents Downloads document texts for processes in base_df.

auth

aggregators.jusbr.client.JusbrScraper.auth(token)

Define o token JWT para autenticacao e o decodifica para verificacao.

auth_firefox

aggregators.jusbr.client.JusbrScraper.auth_firefox()

Authentication via Firefox.

cpopg

aggregators.jusbr.client.JusbrScraper.cpopg(id_cnj)

Consulta processos pelo numero CNJ (ou lista de numeros CNJ) via API nacional.

download_documents

aggregators.jusbr.client.JusbrScraper.download_documents(
    base_df,
    max_docs_per_process=None,
)

Downloads document texts for processes in base_df. Iterates through processes in base_df, extracts document metadata from the ‘detalhes’ column, fetches, and cleans document texts. Returns a DataFrame where each row is a document.