courts.tjmg.client.TJMGScraper
courts.tjmg.client.TJMGScraper(sleep_time=1.0)Scraper for the Court of Justice of Minas Gerais.
The TJMG jurisprudence search uses a 5-digit numeric image captcha that is decoded with txtcaptcha <https://github.com/jtrecenti/txtcaptcha>_. Captcha validation is flagged once per session, so pagination reuses the same HTTP session after the first successful decoding.
Methods
| Name | Description |
|---|---|
| cjsg | Convenience method: download + parse. |
| cjsg_download | Run a TJMG acórdão search and return the raw HTML of each page. |
| cjsg_parse | Transform raw TJMG HTML pages into a DataFrame. |
| cpopg | Stub: first degree case search not implemented for TJMG. |
| cposg | Stub: second degree case search not implemented for TJMG. |
cjsg
courts.tjmg.client.TJMGScraper.cjsg(pesquisa=None, paginas=None, **kwargs)Convenience method: download + parse.
cjsg_download
courts.tjmg.client.TJMGScraper.cjsg_download(
pesquisa=None,
paginas=None,
pesquisar_por='ementa',
order_by=2,
linhas_por_pagina=10,
**kwargs,
)Run a TJMG acórdão search and return the raw HTML of each page.
Parameters
pesquisa : str Free-text search term. paginas : int, list, range or None Pages to download (1-based). None downloads every page (capped at 400 results, the TJMG limit). pesquisar_por : str Field to search in: "ementa" or "acordao" (inteiro teor). order_by : int Sort order: 2 data julgamento, 1 data publicação, 0 precisão. linhas_por_pagina : int Results per page (10, 20 or 50). data_julgamento_inicio, data_julgamento_fim : str Julgamento date range (dd/mm/yyyy or yyyy-mm-dd). data_publicacao_inicio, data_publicacao_fim : str Publicação date range (dd/mm/yyyy or yyyy-mm-dd).
cjsg_parse
courts.tjmg.client.TJMGScraper.cjsg_parse(raw_pages)Transform raw TJMG HTML pages into a DataFrame.
cpopg
courts.tjmg.client.TJMGScraper.cpopg(id_cnj)Stub: first degree case search not implemented for TJMG.
cposg
courts.tjmg.client.TJMGScraper.cposg(id_cnj)Stub: second degree case search not implemented for TJMG.