courts.tjsp.client.TJSPScraper

courts.tjsp.client.TJSPScraper(
    verbose=0,
    download_path=None,
    sleep_time=0.5,
    **kwargs,
)

Main scraper for Tribunal de Justica de Sao Paulo.

Methods

Name	Description
cjpg	Orchestrates the download and parsing of processes from CJPG.
cjpg_download	Downloads the processes from the TJSP jurisprudence.
cjpg_parse	Wrapper for parsing downloaded files from CJPG.
cjsg	Orchestrates the download and parsing of processes from CJSG.
cjsg_download	Downloads the HTML files of the pages of results of the
cjsg_parse	Wrapper for parsing downloaded files from CJSG.
cpopg	Scrapes a process from Primeiro Grau (CPOPG).
cpopg_download	Downloads a process from Primeiro Grau (CPOPG).
cpopg_parse	Wrapper for parsing downloaded files from CPOPG.
cposg	Orchestrates the download and parsing of processes from Segundo Grau (CPOSG).
cposg_download	Downloads processes from Segundo Grau (CPOSG), via HTML or API, using modularized functions.
cposg_parse	Wrapper for parsing downloaded files from CPOSG.
set_download_path	Sets the base directory for saving downloaded files.
set_method	Sets the method for accessing TJSP data.

cjpg

courts.tjsp.client.TJSPScraper.cjpg(
    pesquisa='',
    classes=None,
    assuntos=None,
    varas=None,
    id_processo=None,
    data_inicio=None,
    data_fim=None,
    paginas=None,
)

Orchestrates the download and parsing of processes from CJPG.

Parameters

Name	Type	Description	Default
pesquisa	str	The search term. Default is “” (empty string).	`''`
classes	list[str]	List of classes of the process. Default is None.	`None`
assuntos	list[str]	List of subjects of the process. Default is None.	`None`
varas	list[str]	List of varas of the process. Default is None.	`None`
id_processo	str	ID of the process. Default is None.	`None`
data_inicio	str	Start date of the search. Default is None.	`None`
data_fim	str	End date of the search. Default is None.	`None`
paginas	range	Range of pages to download. Default is None.	`None`

cjpg_download

courts.tjsp.client.TJSPScraper.cjpg_download(
    pesquisa,
    classes=None,
    assuntos=None,
    varas=None,
    id_processo=None,
    data_inicio=None,
    data_fim=None,
    paginas=None,
)

Downloads the processes from the TJSP jurisprudence.

Parameters

Name	Type	Description	Default
pesquisa	str	The search term.	required
classes	list[str]	List of classes of the process. Default is None.	`None`
assuntos	list[str]	List of subjects of the process. Default is None.	`None`
varas	list[str]	List of varas of the process. Default is None.	`None`
id_processo	str	ID of the process. Default is None.	`None`
data_inicio	str	Start date of the search. Default is None.	`None`
data_fim	str	End date of the search. Default is None.	`None`
paginas	range	Pages to download. Default is None.	`None`

cjpg_parse

courts.tjsp.client.TJSPScraper.cjpg_parse(path)

Wrapper for parsing downloaded files from CJPG.

cjsg

courts.tjsp.client.TJSPScraper.cjsg(
    pesquisa,
    ementa=None,
    classe=None,
    assunto=None,
    comarca=None,
    orgao_julgador=None,
    data_inicio=None,
    data_fim=None,
    baixar_sg=True,
    tipo_decisao='acordao',
    paginas=None,
)

Orchestrates the download and parsing of processes from CJSG.

cjsg_download

courts.tjsp.client.TJSPScraper.cjsg_download(
    pesquisa,
    ementa=None,
    classe=None,
    assunto=None,
    comarca=None,
    orgao_julgador=None,
    data_inicio=None,
    data_fim=None,
    baixar_sg=True,
    tipo_decisao='acordao',
    paginas=None,
)

Downloads the HTML files of the pages of results of the Second Stage Judgment Consultation (CJSG).

Parameters

Name	Type	Description	Default
pesquisa	str	Search term.	required
ementa	str	Filter by text of the ementa.	`None`
classe	str \| None	Class of the process.	`None`
assunto	str \| None	Subject of the process.	`None`
comarca	str \| None	Court of the process.	`None`
orgao_julgador	str \| None	Court of appeal of the process.	`None`
data_inicio	str \| None	Start date of the process.	`None`
data_fim	str \| None	End date of the process.	`None`
baixar_sg	bool	If True, also downloads from Second Stage.	`True`
tipo_decisao	str	‘acordao’ or ‘monocratica’.	`'acordao'`
paginas	range	Range of pages to download.	`None`

NOTE: range(0, n) downloads pages 1 to n (inclusive), following the user’s expectation (example: range(0,3) downloads pages 1, 2 and 3).

cjsg_parse

courts.tjsp.client.TJSPScraper.cjsg_parse(path)

Wrapper for parsing downloaded files from CJSG.

cpopg

courts.tjsp.client.TJSPScraper.cpopg(id_cnj, method='html')

Scrapes a process from Primeiro Grau (CPOPG).

cpopg_download

courts.tjsp.client.TJSPScraper.cpopg_download(id_cnj, method='html')

Downloads a process from Primeiro Grau (CPOPG).

Parameters

Name	Type	Description	Default
id_cnj	Union[str, List[str]]	string with the CNJ of the process, or list of strings with CNJs.	required
method	Literal['html', 'api']	Literal[‘html’, ‘api’]. The methods supported are ‘html’ and ‘api’. The default is ‘html’.	`'html'`

Raises

Name	Type	Description
	Exception	If the method passed as parameter is not ‘html’ or ‘api’.

cpopg_parse

courts.tjsp.client.TJSPScraper.cpopg_parse(path)

Wrapper for parsing downloaded files from CPOPG.

cposg

courts.tjsp.client.TJSPScraper.cposg(id_cnj, method='html')

Orchestrates the download and parsing of processes from Segundo Grau (CPOSG).

cposg_download

courts.tjsp.client.TJSPScraper.cposg_download(id_cnj, method='html')

Downloads processes from Segundo Grau (CPOSG), via HTML or API, using modularized functions.

cposg_parse

courts.tjsp.client.TJSPScraper.cposg_parse(path)

Wrapper for parsing downloaded files from CPOSG.

set_download_path

courts.tjsp.client.TJSPScraper.set_download_path(path=None)

Sets the base directory for saving downloaded files.

Parameters

Name	Type	Description	Default
path	str	Path to save downloaded files. Default is None (uses temporary directory).	`None`

set_method

courts.tjsp.client.TJSPScraper.set_method(method)

Sets the method for accessing TJSP data.

Parameters

Name	Type	Description	Default
method	Literal['html', 'api']	Literal[‘html’, ‘api’]. The methods supported are ‘html’ and ‘api’.	required

Raises

Name	Type	Description
	Exception	If the method is not ‘html’ or ‘api’.