Client

The tulit Client library supports multiple legal document retrieval sources organized by jurisdiction:

EU Level Clients

Cellar: EU Publications Office SPARQL endpoint for retrieving EU legal documents

Member State Clients

Finland (Finlex): Finnish legal database
France (Legifrance): French legal database
Germany (RIS): German legal information system
Italy (Normattiva): Italian legal database
Luxembourg (Legilux): Luxembourg legal portal
Malta: Maltese legal information
Portugal (DRE): Portuguese official gazette
Spain (BOE): Spanish official gazette
Ireland (Irish Statute Book): Irish legal database

Regional Clients

Veneto: Italian regional legislation (Veneto region)

Base Client

class tulit.client.client.Client(download_dir, log_dir, proxies=None)

Bases: object

A generic document downloader class.

__init__(download_dir, log_dir, proxies=None)

Initializes the downloader with directories for downloads and logs.

Parameters:

download_dir (str) – Directory where downloaded files will be saved.
log_dir (str) – Directory where log files will be saved.

handle_response(response, filename)

Handle a server response by saving or extracting its content.

Parameters:

response (requests.Response) – The HTTP response object.
folder_path (str) – Directory where the file will be saved.
cid (str) – CELLAR ID of the document.

Returns:

Path to the saved file or None if the response couldn’t be processed.

Return type:

str or None

get_extension_from_content_type(content_type)

Map Content-Type to a file extension.

Parameters:: content_type (str) – The Content-Type header from the server response.
Returns:: File extension corresponding to the Content-Type
Return type:: str or None

extract_zip(response, folder_path)

Extracts the content of a zip file.

Parameters:

response (requests.Response) – The HTTP response object.
folder_path (str) – Directory where the zip file will be extracted.

EU Clients

class tulit.client.eu.cellar.CellarClient(download_dir, log_dir, proxies=None)

Bases: Client

send_sparql_query(sparql_query, celex=None)

Sends a SPARQL query to the EU SPARQL endpoint and stores the results in a JSON file.

Parameters:

sparql_query_filepath (str) – The path to the file containing the SPARQL query.
response_file (str) – The path to the file where the results will be stored.

Return type:

None

Raises:

FileNotFoundError – If the SPARQL query file is not found.
Exception – If there is an error sending the query or storing the results.

Notes

This function assumes that the SPARQL query file contains a valid SPARQL query. The results are stored in JSON format.

get_results_table(sparql_query)

Sends a SPARQL query to the EU SPARQL endpoint and returns the results as a JSON object.

Parameters:: sparql_query (str) – The SPARQL query as a string.
Returns:: The results of the SPARQL query in JSON format.
Return type:: dict
Raises:: Exception – If there is an error sending the query or retrieving the results.

Notes

This function uses the SPARQLWrapper library to send the query and retrieve the results. The results are returned in JSON format.

fetch_content(url) → Response

Send a GET request to download a file

Parameters:: url (str) – The URL to send the request to.
Returns:: The response from the server.
Return type:: requests.Response

Notes

The request is sent with the following headers: - Accept: application/zip;mtype=fmx4, application/xml;mtype=fmx4, application/xhtml+xml, text/html, text/html;type=simplified, application/msword, text/plain, application/xml;notice=object - Accept-Language: eng - Content-Type: application/x-www-form-urlencoded - Host: publications.europa.eu - User-Agent: Browser user agent (required by EU server to bypass bot protection)

Raises:: requests.RequestException – If there is an error sending the request.

Member State Clients

Finland (Finlex)

class tulit.client.state.finlex.FinlexClient(download_dir, log_dir, proxies=None)

Bases: Client

Client for retrieving legal documents from the Finlex Open Data REST API. API docs: https://opendata.finlex.fi/finlex/avoindata/v1

BASE_URL = 'https://opendata.finlex.fi/finlex/avoindata/v1'

download(year, number, lang='fi', doc_type='act', fmt='xml'): Download a statute XML from Finlex Open Data API. Example endpoint: /akn/fi/act/statute/2024/123/fin@

France (Legifrance)

class tulit.client.state.legifrance.LegifranceClient(client_id, client_secret, download_dir='./data/france/legifrance', log_dir='./data/logs', proxies=None)

Bases: Client

Client for interacting with the Legifrance API.

The Legifrance API provides access to French legal documents including: - Codes - Laws and decrees (LODA) - Legislative dossiers - Official journals (JORF) - Collective agreements (KALI) - Administrative documents - Case law (JURI) - Parliamentary debates

This client implements the main controllers: - Consult: retrieve specific documents - List: list documents with pagination - Search: search across documents - Suggest: autocomplete suggestions - Chrono: versioned content

get_token()

Obtain OAuth2 token from the Legifrance authentication service.

Returns:: Access token for API requests
Return type:: str

consult_ping() → Dict[str, Any]

Test the consult controller.

Returns:: Ping response
Return type:: dict

consult_code(text_id: str, date: str | None = None, searched_string: str | None = None, sct_cid: str | None = None, abrogated: bool = False, from_suggest: bool = False) → Dict[str, Any]

Get the content of a Code.

Parameters:

text_id (str) – Text identifier (e.g., ‘LEGITEXT000006070721’ for Code Civil)
date (str, optional) – Date for versioned content (format: YYYY-MM-DD)
searched_string (str, optional) – Search string to highlight in the document
sct_cid (str, optional) – Section CID to retrieve specific section
abrogated (bool, optional) – Include abrogated versions (default: False)
from_suggest (bool, optional) – Indicates if request comes from suggest (default: False)

Returns:

Code content

Return type:

Client

Base Client

EU Clients

Member State Clients

Finland (Finlex)

France (Legifrance)

Germany (RIS)

Ireland

Italy (Normattiva)

Luxembourg (Legilux)

Malta

Portugal (DRE)

Spain (BOE)

Regional Clients

Veneto