QCPortal API Reference#
- class PortalClient(address, username=None, password=None, verify=True, show_motd=True, *, cache_dir=None, cache_max_size=0)[source]#
Bases:
PortalClientBase
Main class for interacting with a QCArchive server
- Parameters:
address (str) – The host or IP address of the FractalServer instance, including protocol and port if necessary (”https://ml.qcarchive.molssi.org”, “http://192.168.1.10:8888”)
username (Optional[str]) – The username to authenticate with.
password (Optional[str]) – The password to authenticate with.
verify (bool) – Verifies the SSL connection with a third party server. This may be False if a FractalServer was not provided an SSL certificate and defaults back to self-signed SSL keys.
show_motd (bool) – If a Message-of-the-Day is available, display it
cache_dir (Optional[str]) – Directory to store an internal cache of records and other data
cache_max_size (int) – Maximum size of the cache directory
- get_server_information()[source]#
Request general information about the server
- Returns:
Server information.
- Return type:
Dict[str, Any]
- set_motd(new_motd)[source]#
Sets the Message-of-the-Day (MOTD) on the server
- Parameters:
new_motd (str)
- Return type:
str
- query_dataset_records(record_id, dataset_type=None)[source]#
- Parameters:
record_id (int | Iterable[int])
dataset_type (Iterable[str] | None)
- create_dataset_view(dataset_id, file_path, include=None, overwrite=False)[source]#
- Parameters:
dataset_id (int)
file_path (str)
include (Iterable[str] | None)
overwrite (bool)
- get_dataset_status_by_id(dataset_id)[source]#
- Parameters:
dataset_id (int)
- Return type:
Dict[str, Dict[RecordStatusEnum, int]]
- add_dataset(dataset_type, name, description=None, tagline=None, tags=None, group=None, provenance=None, visibility=True, default_tag='*', default_priority=PriorityEnum.normal, metadata=None, owner_group=None, existing_ok=False)[source]#
- Parameters:
dataset_type (str)
name (str)
description (str | None)
tagline (str | None)
tags (List[str] | None)
group (str | None)
provenance (Dict[str, Any] | None)
visibility (bool)
default_tag (str)
default_priority (PriorityEnum)
metadata (Dict[str, Any] | None)
owner_group (str | None)
existing_ok (bool)
- Return type:
- delete_dataset(dataset_id, delete_records)[source]#
- Parameters:
dataset_id (int)
delete_records (bool)
- download_external_file(file_id, destination_path, overwrite=False)[source]#
Downloads an external file to the given path
The file size and checksum will be checked against the metadata stored on the server
- Parameters:
file_id (int) – ID of the file to obtain
destination_path (str) – Full path to the destination file (including filename)
overwrite (bool) – If True, allow for overwriting an existing file. If False, and a file already exists at the given destination path, an exception will be raised.
- Returns:
A tuple of file size and sha256 checksum.
- Return type:
Tuple[int, str]
- get_molecules(molecule_ids, missing_ok=False)[source]#
Obtains molecules with the specified IDs from the server
- Parameters:
molecule_ids (int | Sequence[int]) – A single molecule ID, or a list (or other sequence) of molecule IDs
missing_ok (bool) – If set to True, then missing molecules will be tolerated, and the returned list of Molecules will contain None for the corresponding IDs that were not found.
- Returns:
Molecules, in the same order as the requested ids. If given a sequence of ids, the return value will be a list. Otherwise, it will be a single Molecule.
- Return type:
Molecule | None | List[Molecule | None]
- query_molecules(*, molecule_hash=None, molecular_formula=None, identifiers=None, limit=None)[source]#
Query molecules by attributes.
Do not rely on the returned molecules being in any particular order.
- Parameters:
molecule_hash (str | Iterable[str] | None) – Queries molecules by hash
molecular_formula (str | Iterable[str] | None) – Queries molecules by molecular formula. Molecular formulas are not order-sensitive (e.g. “H2O == OH2 != Oh2”).
identifiers (Dict[str, str | Iterable[str]] | None) – Additional identifiers to search for (smiles, etc)
limit (int | None) – The maximum number of Molecules to return. Note that the server limit is always obeyed.
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_molecules(molecules)[source]#
Add molecules to the server database
If the same molecule (defined by having the same hash) already exists, then the existing molecule is kept and that particular molecule is not added.
- Parameters:
molecules (Sequence[Molecule]) – A list of Molecules to add to the server.
- Returns:
Metadata about what was inserted, and a list of IDs of the molecules in the same order as the molecules parameter.
- Return type:
Tuple[InsertMetadata, List[int]]
- modify_molecule(molecule_id, name=None, comment=None, identifiers=None, overwrite_identifiers=False)[source]#
Modify molecules on the server
This is only capable of updating the name, comment, and identifiers fields (except molecule_hash and molecular formula).
If a molecule with that id does not exist, an exception is raised
- Parameters:
molecule_id (int) – ID of the molecule to modify
name (str | None) – New name for the molecule. If None, name is not changed.
comment (str | None) – New comment for the molecule. If None, comment is not changed
identifiers (Dict[str, Any] | Identifiers | None) – A new set of identifiers for the molecule
overwrite_identifiers (bool) – If True, the identifiers of the molecule are set to be those given exactly (ie, identifiers that exist in the DB but not in the new set will be removed). Otherwise, the new set of identifiers is merged into the existing ones. Note that molecule_hash and molecular_formula are never removed.
- Returns:
Metadata about the modification/update.
- Return type:
- delete_molecules(molecule_ids)[source]#
Deletes molecules from the server
This will not delete any molecules that are in use
- Parameters:
molecule_ids (int | Sequence[int]) – An id or list of ids to query.
- Returns:
Metadata about what was deleted
- Return type:
- get_records(record_ids, missing_ok=False, *, include=None)[source]#
Obtain records of all types with specified IDs
This function will return record objects of the given ID no matter what the type is. All records are unique by ID (ie, an optimization will never have the same ID as a singlepoint).
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
A list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
List[BaseRecord | None] | BaseRecord | None
- query_records(*, record_id=None, record_type=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Query records of all types based on common fields
This is a general query of all record types, so it can only filter by fields that are common among all records.
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
record_type (str | Iterable[str] | None) – Query records whose type is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list (usernames or IDs)
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list (group names or IDS)
limit (int) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- reset_records(record_ids)[source]#
Resets running or errored records to be waiting again
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- cancel_records(record_ids)[source]#
Marks running, waiting, or errored records as cancelled
A cancelled record will not be picked up by a manager.
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- invalidate_records(record_ids)[source]#
Marks a completed record as invalid
An invalid record is one that supposedly successfully completed. However, after review, is not correct.
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- delete_records(record_ids, soft_delete=True, delete_children=True)[source]#
Delete records from the database
If soft_delete is True, then the record is just marked as deleted and actually deletion may happen later. Soft delete can be undone with undelete
- Parameters:
record_ids (int | Sequence[int]) – Reset the status of these record ids
soft_delete – Don’t actually delete the record, just mark it for later deletion
delete_children (bool) – If True, attempt to delete child records as well
- Return type:
- uninvalidate_records(record_ids)[source]#
Undo the invalidation of records
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- uncancel_records(record_ids)[source]#
Undo the cancellation of records
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- undelete_records(record_ids)[source]#
Undo the (soft) deletion of records
- Parameters:
record_ids (int | Sequence[int])
- Return type:
- modify_records(record_ids, new_tag=None, new_priority=None)[source]#
Modify the tag or priority of a record
- Parameters:
record_ids (int | Sequence[int])
new_tag (str | None)
new_priority (PriorityEnum | None)
- Return type:
- add_comment(record_ids, comment)[source]#
Adds a comment to records
- Parameters:
record_ids (int | Sequence[int]) – The record or records to add the comments to
comment (str) – The comment string to add. Your username will be added automatically
- Returns:
Metadata about which records were updated
- Return type:
- get_waiting_reason(record_id)[source]#
Get the reason a record is in the waiting status
The return is a dictionary, with a ‘reason’ key containing the overall reason the record is waiting. If appropriate, there is a ‘details’ key that contains information for each active compute manager on why that manager is not able to pick up the record’s task.
- Parameters:
record_id (int) – The record ID to test
- Returns:
A dictionary containing information about why the record is not being picked up by compute managers
- Return type:
Dict[str, Any]
- add_singlepoints(molecules, program, driver, method, basis, keywords=None, protocols=None, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new singlepoint computations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per molecule.
- Parameters:
molecules (int | Molecule | List[int | Molecule]) – The Molecules or Molecule ids to compute with the above methods
program (str) – The computational program to execute the result with (e.g., “rdkit”, “psi4”).
driver (str) – The primary result that the compute will acquire {“energy”, “gradient”, “hessian”, “properties”}
method (str) – The computational method to use (e.g., “B3LYP”, “PBE”)
basis (str | None) – The basis to apply to the computation (e.g., “cc-pVDZ”, “6-31G”)
keywords (Dict[str, Any] | None) – The program-specific keywords for the computation
protocols (AtomicResultProtocols | Dict[str, Any] | None) – Protocols for storing more/less data for each computation
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_singlepoints(record_ids, missing_ok=False, *, include=None)[source]#
Obtain singlepoint records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
SinglepointRecord | None | List[SinglepointRecord | None]
- query_singlepoints(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, driver=None, method=None, basis=None, keywords=None, molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries singlepoint records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose program is in the given list
driver (SinglepointDriver | Iterable[SinglepointDriver] | None) – Query records whose driver is in the given list
method (str | Iterable[str] | None) – Query records whose method is in the given list
basis (str | Iterable[str | None] | None) – Query records whose basis is in the given list
keywords (Dict[str, Any] | Iterable[Dict[str, Any]] | None) – Query records with these keywords (exact match)
molecule_id (int | Iterable[int] | None) – Query records whose molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_optimizations(initial_molecules, program, qc_specification, keywords=None, protocols=None, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new geometry optimization calculations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per initial molecule.
- Parameters:
initial_molecules (int | Molecule | List[int | Molecule]) – Initial molecule/geometry to optimize
program (str) – Which program to use for the optimization (ie, geometric)
qc_specification (QCSpecification) – The method, basis, etc, to optimize the geometry with
keywords (Dict[str, Any] | None) – Program-specific keywords for the optimization program (not the qc program)
protocols (OptimizationProtocols | None) – Protocols for storing more/less data for each computation (for the optimization)
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_optimizations(record_ids, missing_ok=False, *, include=None)[source]#
Obtain optimization records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
OptimizationRecord | None | List[OptimizationRecord | None]
- query_optimizations(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, qc_program=None, qc_method=None, qc_basis=None, initial_molecule_id=None, final_molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries optimization records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (singlepoint calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose optimization program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose method is in the given list
qc_basis (str | Iterable[str | None] | None) – Query records whose basis is in the given list
initial_molecule_id (int | Iterable[int] | None) – Query records whose initial molecule (id) is in the given list
final_molecule_id (int | Iterable[int] | None) – Query records whose final molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_torsiondrives(initial_molecules, program, optimization_specification, keywords, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new torsiondrive computations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per set of molecules
- Parameters:
initial_molecules (List[List[int | Molecule]]) – Molecules to start the torsiondrives. Each torsiondrive can start with multiple molecules, so this is a nested list
program (str) – The program to run the torsiondrive computation with (“torsiondrive”)
optimization_specification (OptimizationSpecification) – Specification of how each optimization of the torsiondrive should be run
keywords (TorsiondriveKeywords | Dict[str, Any]) – The torsiondrive keywords for the computation
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_torsiondrives(record_ids, missing_ok=False, *, include=None)[source]#
Obtain torsiondrive records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
TorsiondriveRecord | None | List[TorsiondriveRecord | None]
- query_torsiondrives(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, optimization_program=None, qc_program=None, qc_method=None, qc_basis=None, initial_molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries torsiondrive records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (optimization calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose torsiondrive program is in the given list
optimization_program (str | Iterable[str] | None) – Query records whose optimization program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose method is in the given list
qc_basis (str | Iterable[str] | None) – Query records whose basis is in the given list
initial_molecule_id (int | Iterable[int] | None) – Query records whose initial molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_gridoptimizations(initial_molecules, program, optimization_specification, keywords, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new gridoptimization computations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per initial molecule
- Parameters:
initial_molecules (int | Molecule | Sequence[int | Molecule]) –
- Molecules to start the gridoptimizations. Each gridoptimization starts with
a single molecule.
program (str) – The program to run the gridoptimization computation with (“gridoptimization”)
optimization_specification (OptimizationSpecification) – Specification of how each optimization of the gridoptimization should be run
keywords (GridoptimizationKeywords | Dict[str, Any]) – The gridoptimization keywords for the computation
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_gridoptimizations(record_ids, missing_ok=False, *, include=None)[source]#
Obtain gridoptimization records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
GridoptimizationRecord | None | List[GridoptimizationRecord | None]
- query_gridoptimizations(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, optimization_program=None, qc_program=None, qc_method=None, qc_basis=None, initial_molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries gridoptimization records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (optimization calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose gridoptimization program is in the given list
optimization_program (str | Iterable[str] | None) – Query records whose optimization program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose method is in the given list
qc_basis (str | Iterable[str | None] | None) – Query records whose basis is in the given list
initial_molecule_id (int | Iterable[int] | None) – Query records whose initial molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_reactions(stoichiometries, program, singlepoint_specification, optimization_specification, keywords, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new reaction computations to the server
Reactions can have a singlepoint specification, optimization specification, or both; at least one must be specified. If both are specified, an optimization is done, followed by a singlepoint computation. Otherwise, only the specification that is specified is used.
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per set of molecules
- Parameters:
stoichiometries (Sequence[Sequence[Sequence[float, Union[int, Molecule]]]]) – Coefficients and molecules of the reaction. Each reaction has multiple molecules/coefficients, so this is a nested list
program (str) – The program for running the reaction computation (“reaction”)
singlepoint_specification (Optional[QCSpecification]) – The specification for singlepoint energy calculations
optimization_specification (Optional[OptimizationSpecification]) – The specification for optimization calculations
keywords (ReactionKeywords) – The keywords for the reaction calculation/service
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (Optional[str]) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_reactions(record_ids, missing_ok=False, *, include=None)[source]#
Obtain reaction records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
ReactionRecord | None | List[ReactionRecord | None]
- query_reactions(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, optimization_program=None, qc_program=None, qc_method=None, qc_basis=None, molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries reaction records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (optimization calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose reaction program is in the given list
optimization_program (Iterable[str | None] | None) – Query records whose optimization program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose method is in the given list
qc_basis (str | Iterable[str] | None) – Query records whose basis is in the given list
molecule_id (int | Iterable[int] | None) – Query reactions that contain a molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_manybodys(initial_molecules, program, levels, bsse_correction, keywords, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds new manybody expansion computations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per initial molecule.
- Parameters:
initial_molecules (Sequence[int | Molecule]) – Initial molecules for the manybody expansion. Must have > 1 fragments.
program (str) – The program to run the manybody computation with (“manybody”)
singlepoint_specification – Specification for the singlepoint calculations done in the expansion
keywords (ManybodyKeywords | Dict[str, Any]) – The keywords for the manybody program
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
levels (Dict[int | Literal['supersystem'], ~qcportal.singlepoint.record_models.QCSpecification])
bsse_correction (BSSECorrectionEnum | Sequence[BSSECorrectionEnum])
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_manybodys(record_ids, missing_ok=False, *, include=None)[source]#
Obtain manybody records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
ManybodyRecord | None | List[ManybodyRecord | None]
- query_manybodys(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, qc_program=None, qc_method=None, qc_basis=None, initial_molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries reaction records on the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (optimization calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose reaction program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose qc method is in the given list
qc_basis (str | Iterable[str] | None) – Query records whose qc basis is in the given list
initial_molecule_id (int | Iterable[int] | None) – Query manybody calculations that contain an initial molecule (id) is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- add_nebs(initial_chains, program, singlepoint_specification, optimization_specification, keywords, tag='*', priority=PriorityEnum.normal, owner_group=None, find_existing=True)[source]#
Adds neb calculations to the server
This checks if the calculations already exist in the database. If so, it returns the existing id, otherwise it will insert it and return the new id.
This will add one record per set of molecules
- Parameters:
initial_chains (List[List[int | Molecule]]) – The initial chains to run the NEB calculations on . Each NEB calculation starts with a single chain (list of molecules), so this is a nested list
program (str) – The program to run the neb computation with (“geometric”)
singlepoint_specification (QCSpecification) – Specification of how each singlepoint (gradient/hessian) should be run
optimization_specification (OptimizationSpecification | None) – Specification of how any optimizations of the torsiondrive should be run
keywords (NEBKeywords | Dict[str, Any]) – The torsiondrive keywords for the computation
tag (str) – The tag for the task. This will assist in routing to appropriate compute managers.
priority (PriorityEnum) – The priority of the job (high, normal, low). Default is normal.
owner_group (str | None) – Group with additional permission for these records
find_existing (bool) – If True, search for existing records and return those. If False, always add new records
- Returns:
Metadata about the insertion, and a list of record ids. The ids will be in the order of the input molecules
- Return type:
Tuple[InsertMetadata, List[int]]
- get_nebs(record_ids, missing_ok=False, *, include=None)[source]#
Obtain NEB records with the specified IDs.
Records will be returned in the same order as the record ids.
- Parameters:
record_ids (int | Sequence[int]) – Single ID or sequence/list of records to obtain
missing_ok (bool) – If set to True, then missing records will be tolerated, and the returned records will contain None for the corresponding IDs that were not found.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
If a single ID was specified, returns just that record. Otherwise, returns a list of records. If missing_ok was specified, None will be substituted for a record that was not found.
- Return type:
- query_nebs(*, record_id=None, manager_name=None, status=None, dataset_id=None, parent_id=None, child_id=None, created_before=None, created_after=None, modified_before=None, modified_after=None, program=None, qc_program=None, qc_method=None, qc_basis=None, molecule_id=None, owner_user=None, owner_group=None, limit=None, include=None)[source]#
Queries neb records from the server
Do not rely on the returned records being in any particular order.
- Parameters:
record_id (int | Iterable[int] | None) – Query records whose ID is in the given list
manager_name (str | Iterable[str] | None) – Query records that were completed (or are currently runnning) on a manager is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Query records whose status is in the given list
dataset_id (int | Iterable[int] | None) – Query records that are part of a dataset is in the given list
parent_id (int | Iterable[int] | None) – Query records that have a parent is in the given list
child_id (int | Iterable[int] | None) – Query records that have a child (optimization calculation) is in the given list
created_before (datetime | str | None) – Query records that were created before the given date/time
created_after (datetime | str | None) – Query records that were created after the given date/time
modified_before (datetime | str | None) – Query records that were modified before the given date/time
modified_after (datetime | str | None) – Query records that were modified after the given date/time
program (str | Iterable[str] | None) – Query records whose torsiondrive program is in the given list
qc_program (str | Iterable[str] | None) – Query records whose qc program is in the given list
qc_method (str | Iterable[str] | None) – Query records whose method is in the given list
qc_basis (str | Iterable[str] | None) – Query records whose basis is in the given list
molecule_id (int | Iterable[int] | None) – Query records whose initial chains contain a molecule (id) that is in the given list
owner_user (int | str | Iterable[int | str] | None) – Query records owned by a user in the given list
owner_group (int | str | Iterable[int | str] | None) – Query records owned by a group in the given list
limit (int | None) – The maximum number of records to return. Note that the server limit is always obeyed.
include (Iterable[str] | None) – Additional fields to include in the returned record
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- get_managers(names, missing_ok=False)[source]#
Obtain manager information from the server with the specified names
- Parameters:
names (str | Sequence[str]) – A manager name or list of names
missing_ok (bool) – If set to True, then missing managers will be tolerated, and the returned managers will contain None for the corresponding managers that were not found.
- Returns:
If a single name was specified, returns just that manager. Otherwise, returns a list of managers. If missing_ok was specified, None will be substituted for a manager that was not found.
- Return type:
ComputeManager | None | List[ComputeManager | None]
- query_managers(*, manager_id=None, name=None, cluster=None, hostname=None, status=None, modified_before=None, modified_after=None, limit=None)[source]#
Queries manager information on the server
- Parameters:
manager_id (int | Iterable[int] | None) – ID assigned to the manager (this is not the UUID. This should be used very rarely).
name (str | Iterable[str] | None) – Queries managers whose name is in the given list
cluster (str | Iterable[str] | None) – Queries managers whose assigned cluster is in the given list
hostname (str | Iterable[str] | None) – Queries managers whose hostname is in the given list
status (RecordStatusEnum | Iterable[RecordStatusEnum] | None) – Queries managers whose status is in the given list
modified_before (datetime | str | None) – Query for managers last modified before a certain time
modified_after (datetime | str | None) – Query for managers last modified after a certain time
limit (int | None) – The maximum number of managers to return. Note that the server limit is always obeyed.
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
ManagerQueryIterator
- query_access_log(*, module=None, method=None, before=None, after=None, user=None, limit=None)[source]#
Query the server access log
This log contains information about who accessed the server, and when.
- Parameters:
module (str | Iterable[str] | None) – Return log entries whose module is in the given list
method (str | Iterable[str] | None) – Return log entries whose access_method is in the given list
before (datetime | str | None) – Return log entries captured before the specified date/time
after (datetime | str | None) – Return log entries captured after the specified date/time
user (int | str | Iterable[int | str] | None) – User name or ID associated with the log entry
limit (int | None) – The maximum number of log entries to return. Note that the server limit is always obeyed.
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
AccessLogQueryIterator
- delete_access_log(before)[source]#
Delete access log entries from the server
- Parameters:
before (datetime) – Delete access log entries captured before the given date/time
- Returns:
The number of access log entries deleted from the server
- Return type:
int
- query_error_log(*, error_id=None, user=None, before=None, after=None, limit=None)[source]#
Query the server’s internal error log
This log contains internal errors that are not always passed to the user.
- Parameters:
error_id (int | Iterable[int] | None) – Return error log entries whose id is in the list
user (int | str | Iterable[int | str] | None) – Return error log entries whose user name or ID is in the list
before (datetime | str | None) – Return error log entries captured before the specified date/time
after (datetime | str | None) – Return error log entries captured after the specified date/time
limit (int | None) – The maximum number of log entries to return. Note that the server limit is always obeyed.
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
ErrorLogQueryIterator
- delete_error_log(before)[source]#
Delete error log entries from the server
- Parameters:
before (datetime) – Delete error log entries captured before the given date/time
- Returns:
The number of error log entries deleted from the server
- Return type:
int
- get_internal_job(job_id)[source]#
Gets information about an internal job on the server
- Parameters:
job_id (int)
- Return type:
- query_internal_jobs(*, job_id=None, name=None, user=None, runner_hostname=None, status=None, last_updated_before=None, last_updated_after=None, added_before=None, added_after=None, scheduled_before=None, scheduled_after=None, limit=None)[source]#
Queries the internal job queue on the server
- Parameters:
job_id (Optional[int, Iterable[int]]) – ID assigned to the job
name (Optional[Union[str, Iterable[str]]]) – Queries jobs whose name is in the given list
user (Optional[Union[int, str, Iterable[Union[int, str]]]]) – User name or ID associated with the log entry
runner_hostname (Optional[Union[str, Iterable[str]]]) – Queries jobs that were run/are running on a given host
status (Optional[Union[InternalJobStatusEnum, Iterable[InternalJobStatusEnum]]]) – Queries jobs whose status is in the given list
last_updated_before (Optional[Union[datetime, str]]) – Query for jobs last updated before a certain time
last_updated_after (Optional[Union[datetime, str]]) – Query for jobs last updated after a certain time
added_before (Optional[Union[datetime, str]]) – Query for jobs added before a certain time
added_after (Optional[Union[datetime, str]]) – Query for jobs added after a certain time
scheduled_before (Optional[Union[datetime, str]]) – Query for jobs scheduled to run before a certain time
scheduled_after (Optional[Union[datetime, str]]) – Query for jobs scheduled to run after a certain time
limit (Optional[int]) – The maximum number of jobs to return. Note that the server limit is always obeyed.
- Returns:
An iterator that can be used to retrieve the results of the query
- Return type:
- cancel_internal_job(job_id)[source]#
Cancels (to the best of our ability) an internal job
- Parameters:
job_id (int)
- query_access_summary(*, group_by='day', before=None, after=None)[source]#
Obtains summaries of access data
This aggregate data is created on the server, so you don’t need to download all the log entries and do it yourself.
- Parameters:
group_by (str) – How to group the data. Valid options are “user”, “hour”, “day”, “country”, “subdivision”
before (datetime | str | None) – Query for log entries with a timestamp before a specific time
after (datetime | str | None) – Query for log entries with a timestamp after a specific time
- Return type:
AccessLogSummary
- get_role(rolename)[source]#
Get information about a role on the server
- Parameters:
rolename (str)
- Return type:
RoleInfo
- add_role(role_info)[source]#
Adds a role with permissions to the server
If not successful, an exception is raised.
- Parameters:
role_info (RoleInfo)
- Return type:
None
- modify_role(role_info)[source]#
Modifies the permissions of a role on the server
If not successful, an exception is raised.
- Returns:
A copy of the role as it now appears on the server
- Parameters:
role_info (RoleInfo)
- Return type:
RoleInfo
- delete_role(rolename)[source]#
Deletes a role from the server
This will not delete any role to which a user is assigned
Will raise an exception on error
- Parameters:
rolename (str) – Name of the role to delete
- Return type:
None
- get_group(groupname_or_id)[source]#
Get information about a group on the server
- Parameters:
groupname_or_id (int | str)
- Return type:
GroupInfo
- add_group(group_info)[source]#
Adds a group with permissions to the server
If not successful, an exception is raised.
- Parameters:
group_info (GroupInfo)
- Return type:
None
- delete_group(groupname_or_id)[source]#
Deletes a group on the server
Deleted groups will be removed from all users groups list
- Parameters:
groupname_or_id (int | str)
- get_user(username_or_id=None)[source]#
Get information about a user on the server
If the username is not supplied, then info about the currently logged-in user is obtained.
- Parameters:
username_or_id (int | str | None) – The username or ID to get info about
- Returns:
Information about the user
- Return type:
UserInfo
- add_user(user_info, password=None)[source]#
Adds a user to the server
- Parameters:
user_info (UserInfo) – Info about the user to add
password (str | None) – The user’s password. If None, then one will be generated
- Returns:
The password of the user (either the same as the supplied password, or the server-generated one)
- Return type:
str
- modify_user(user_info)[source]#
Modifies a user on the server
The user is determined by the id field of the input UserInfo, although the id and username are checked for consistency.
Depending on the current user’s permissions, some fields may not be updatable.
- Parameters:
user_info (UserInfo) – Updated information for a user
- Returns:
The updated user information as it appears on the server
- Return type:
UserInfo
- change_user_password(username_or_id=None, new_password=None)[source]#
Change a users password
If the username is not specified, then the current logged-in user is used.
If the password is not specified, then one is automatically generated by the server.
- Parameters:
username_or_id (int | str | None) – The name or ID of the user whose password to change. If None, then use the currently logged-in user
new_password (str | None) – Password to change to. If None, let the server generate one.
- Returns:
The new password (either the same as the supplied one, or the server generated one
- Return type:
str
- delete_user(username_or_id)[source]#
Delete a user from the server
- Parameters:
username_or_id (int | str)
- Return type:
None
- download_file(endpoint, destination_path, overwrite=False)#
- Parameters:
endpoint (str)
destination_path (str)
overwrite (bool)
- Return type:
Tuple[int, str]
- property encoding: str#
- classmethod from_env()#
Creates a new client given information stored in environment variables
The environment variables are:
QCPORTAL_ADDRESS (required)
QCPORTAL_USERNAME (optional)
QCPORTAL_PASSWORD (optional)
QCPORTAL_VERIFY (optional, defaults to True)
QCPORTAL_CACHE_DIR (optional)
- classmethod from_file(server_name=None, config_path=None)#
Creates a new client given information in a file.
If no path is passed in, the current working directory and finally ~/.qca are searched for “qcportal_config.yaml”
- Parameters:
server_name (str | None) – Name/alias of the server in the yaml file
config_path (str | None) – Full path to a configuration file, or a directory containing “qcportal_config.yaml”.
- make_request(method, endpoint, response_model, *, body_model=None, url_params_model=None, body=None, url_params=None, allow_retries=True)#
- Parameters:
method (str)
endpoint (str)
response_model (Type[_V] | None)
body_model (Type[_T] | None)
url_params_model (Type[_U] | None)
body (_T | Dict[str, Any] | None)
url_params (_U | Dict[str, Any] | None)
allow_retries (bool)
- Return type:
_V
- ping()#
Pings the server to see if it is up
- Returns:
True if the server is up and responded to the ping. False otherwise
- Return type:
bool
- class MoleculeQueryIterator(client, query_filters)[source]#
Bases:
QueryIteratorBase
[Molecule
]Iterator for molecule queries
This iterator transparently handles batching and pagination over the results of a molecule query.
Construct an iterator
- Parameters:
client – QCPortal client object used to contact/retrieve data from the server
query_filters (MoleculeQueryFilters) – The actual query information to send to the server
- reset()#
Starts retrieval of results from the beginning again