Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Follow the PBS Pro Design Document Guidelines.
Link to discussion on Developer Forum: <http://community.pbspro.org/t/design-for-refactoring-pbs-database-code/2009>
Link to issue: <issue link if available>
Link to pull request: <PR link if available>

For easy maintainability, current database-related code within the PBS server needs to be refactored into a separate pluggable dynamic library. And this design page discusses the APIs using which PBS server can talk to Libdb, the database dynamic library using which PBS can talk to the database. Another motivation behind this refactoring is the added advantage of moving to any database without having to do major code changes within PBS itself.

1.1. Files: List of files PBS will need to have database support.

PBS_EXEC/lib/libdb.so

Description: This dynamic library will have the functionality for the PBS server to access the database.

...

Usage: PBS_EXEC/libexec/pbs_db_utility <install_db | upgrade_db | init_db_env>

Parameters:

install_db: Create and initialize the dataservice instance.

upgrade_db: Upgrade the database.init

PBS_EXEC/libexec/pbs_db_env

...

Description: This script will initialize the environment with information related to the database installation path.

PBS_EXEC/include/pbs_db.h

Description: This header file will provide declarations for APIs and argument structures listed below.

pbsconn_db_conn_thandle: Structure Pointer used to maintain the database connection information. All elements of this structure are generic and are not bound to any particular database.

Code Block
struct pbs_db_connection {
      void    *conn_db_handle;      /* opaque database handle  */
      int     conn_state;           /* Database connection status */
      int     conn_db_state;        /* Databese states, db up? down? starting? */
}
typedef struct pbs_db_connection pbs_db_conn_t;

/* Values for conn_state */
#define PBS_DB_CONNECT_STATE_NOT_CONNECTED  1 /* Not connected to the db */
#define PBS_DB_CONNECT_STATE_CONNECTING	    2 /* Connecting to the db */
#define PBS_DB_CONNECT_STATE_CONNECTED      3 /* Connected to the db */
#define PBS_DB_CONNECT_STATE_FAILED         4 /* Coonection to db failed */

/* Values for conn_db_state */
#define PBS_DB_DOWN		1  /* Database is down */
#define PBS_DB_STARTING		2  /* Database is starting */
#define PBS_DB_STARTED		3  /* Database is up */

...

pbs_db_obj_info_t: Wrapper object structure. It contains a pointer to one of the several database structures. Most of the database manipulation/query functions take this structure as a parameter. Depending on the contained structure type, an appropriate internal database manipulation/query function is eventually called. This allows keeping the interface simpler and generic.

Code Block
languagec
struct pbs_db_obj_info {
  int 	pbs_db_obj_type; /* identifies the contained object type */
  union {
	pbs_db_job_info_t	*pbs_db_job; 	/* map database job structure to C */
	pbs_db_jobscr_info_t	*pbs_db_jobscr;	/* map database job script to C */
	pbs_db_resv_info_t	*pbs_db_resv;	/* map database resv structure to C */
	pbs_db_svr_info_t	*pbs_db_svr;	/* map database server structure to C */
	pbs_db_que_info_t	*pbs_db_que;	/* map database queue structure to C */
	pbs_db_node_info_t	*pbs_db_node;	/* map database node structure to C */
	pbs_db_sched_info_t	*pbs_db_sched;	/* map database scheduler structure to C */	
	pbs_db_mominfo_time_t	*pbs_db_mominfo_tm; /* map database mominfo_time structure to C */
  } pbs_db_un;
};
typedef struct pbs_db_obj_info pbs_db_obj_info_t;

/* Structure used to map database job structure to C */
struct pbs_db_job_info {
	char     ji_jobid[PBS_MAXSVRJOBID + 1]; /* job identifier */
	INTEGER  ji_state;    	/* INTEGERernalInternal copy of state */
	INTEGER  ji_substate; 	/* job sub-statestate */
	INTEGER  ji_svrflags; 	/* server flags */
	INTEGER  ji_svrflagsnumattr; 	/* servernot flagsused */
	INTEGER  ji_ordering; 	/* special scheduling ordering */
	INTEGER  ji_priority; 	/* INTEGERernal priority */
	BIGINT   ji_stime;    	/* time job started execution */
	BIGINT   ji_endtBdry; 	/* estimate upper bound on end time */
	char     ji_queue[PBS_MAXQUEUENAME + 1]; /* name of current queue */
	char     ji_destin[PBS_MAXROUTEDEST + 1]; /* dest from qmove/route */
	INTEGER  ji_un_type;  	/* job's queue type */
	INTEGER  ji_momaddr;  	/* host addr of Server */
	INTEGER  ji_momport;  	/* port # */
	INTEGER  ji_exitstat; 	/* job exit status from MOM */
	BIGINT   ji_quetime;  	/* time entered queue */
	BIGINT   ji_rteretry; 	/* route retry time */
	INTEGER  ji_fromsock; 	/* socket job coming over */
	BIGINT   ji_fromaddr; 	/* host job coming from   */
	char     ji_4jid[8];  	/* extended job save data */
	char     ji_4ash[8];  /* extended job save data */
	INTEGER  ji_credtype; /* credential type */
	INTEGER  ji_qrank8];    	/* sortextended keyjob forsave dbdata query */
	BIGINTINTEGER   ji_savetmcredtype;   	/* jobcredential savetype time */
	BIGINT   ji_creattmqrank;    	/* sort jobkey for createdb timequery */
	pbs_db_attr_list_t db_attr_list; /* list of attributes for database */
};
typedef struct pbs_db_job_info pbs_db_job_info_t;

/* Structure used to map database resv structure to C */
struct pbs_db_resv_info {
	char    ri_resvid[PBS_MAXSVRJOBID + 1]; /* reservation identifier */
	char    ri_queue[PBS_MAXQUEUENAME + 1]; /* queue used by reservation */
	INTEGER ri_state;	/* internal copy of state */
	INTEGER ri_substate;	/* substate of resv state */
	INTEGER ri_type;	/* substate "reservation"of orresv "reservationstate job"*/
	BIGINT  ri_stime;	/* left window boundry  */
	BIGINT  ri_etime;	/* right window boundry */
	BIGINT  ri_duration;	/* reservation duration */
	INTEGER ri_tactive;	/* time reservation became active */
	INTEGER ri_svrflags;	/* server flags */
	INTEGER ri_fromsock;	/* resv from sock */
	BIGINT  ri_fromaddr;	/* resv from sock addr */
	BIGINT  ri_creattmsvrflags;	/* resvserver create time on db flags */
	BIGINTINTEGER  ri_savetmnumattr;	/* resv save time on dbnot used */
	pbs_db_attr_list_t attr_list; INTEGER ri_resvTag;	/* listnot ofused attributes */
};
typedef struct pbs_db_resv_info pbs_db_resv_info_t;

	INTEGER ri_un_type;	/* Structurenot used to*/
map database server structure to C	INTEGER ri_fromsock;	/* resv from sock */
struct pbs_db_svr_info {
	INTEGER sv_numjobs; 	BIGINT  ri_fromaddr;	/* numberresv offrom jobsock ownedaddr by server */
	INTEGER sv_numque;	pbs_db_attr_list_t db_attr_list; /* nuberlist of queuesattributes managed */
	BIGINT  sv_jobidnumber;};
typedef struct pbs_db_resv_info pbs_db_resv_info_t;

/* nextStructure numberused to map usedatabase inserver newstructure jobidto C */
	BIGINTstruct  sv_creattm; 	/* time of server db create */ 
pbs_db_svr_info {
	BIGINT  sv_savetmjobidnumber;	/*
time of server db update */
	pbs_db_attr_list_t db_attr_list; /* list of attributes */
};
typedef struct pbs_db_svr_info pbs_db_svr_info_t;

/* Structure used to map database scheduler structure to C */
struct pbs_db_sched_info {
	char    sched_name[PBS_MAXSCHEDNAME+1]; /* sched name */
	BIGINT  sched_creattm;	/* sched create time on db */
	BIGINT  sched_savetm;	 {
	char    sched_name[PBS_MAXSCHEDNAME+1]; /* sched save time on dbname */
	pbs_db_attr_list_t db_attr_list; 	/* list of attributes */
};
typedef struct pbs_db_sched_info pbs_db_sched_info_t;

/* Structure used to map database queue structure to C */
struct pbs_db_que_info {
	char    qu_name[PBS_MAXQUEUENAME +1]; /* queue name */
	INTEGER qu_type;	/* queue type: exec, route */
	BIGINT  qu_ctime;1]; 	/* time queue createdname */
	BIGINTINTEGER  qu_mtimetype;		 	/* timequeue queuetype: lastexec, modifiedroute */
	pbs_db_attr_list_t db_attr_list; 	/* list of attributes */
};
typedef struct pbs_db_que_info pbs_db_que_info_t;

/* Structure used to map database node structure to C */
struct pbs_db_node_info {
	char	nd_name[PBS_MAXSERVERNAME+1]; /* vnode's name */
	INTEGER nd_index;	/* global node index */
	BIGINT	mom_modtime; 	/* node config update time */
	char	nd_hostname[PBS_MAXSERVERNAME+1]; /* node hostname */
	INTEGER nd_state;	/* state of node */
	INTEGER nd_ntype;	/* node type */
	char	nd_pque[PBS_MAXSERVERNAME+1]; /* queue to which it belongs */
	BIGINT  nd_creattm; 	/* node create time */
	BIGINT  nd_svtime;	/* node save time it belongs */
	pbs_db_attr_list_t db_attr_list; /* list of attributes */
};
typedef struct pbs_db_node_info pbs_db_node_info_t;

/* Structure used to map database mominfo_time structure to C */
struct pbs_db_mominfo_time {
	BIGINT	mit_time; /* time of the host to vnode map */
	INTEGER mit_gen;  /* generation of the host to vnode map */
};
typedef struct pbs_db_mominfo_time pbs_db_mominfo_time_t;

/* Structure used to map database job script to C */
struct pbs_db_jobscr_info {
	char     ji_jobid[PBS_MAXSVRJOBID + 1]; /* job identifier */
	TEXT     script;	/* job script */
};
typedef struct pbs_db_jobscr_info pbs_db_jobscr_info_t;

pbs_db_query_options_t: Structure used to pass database query options to database functions. Flags field can be used to pass any flags to a query function. Timestamp field can be used to pass a timestamp, to return rows that have a modification timestamp newer (more recent) than the timestamp passed. (Basically to return rows that have been modified since a point of time)_t: Structure used to pass database query options to database functions

...

Code Block
struct pbs_db_query_options {
  int	flags; /* Flags field can be used to pass any flags to a query function. */
  time_t timestamp; /* Timestamp field can be used to pass a timestamp, to return rows that have a modification timestamp newer (more recent) than the timestamp passed. (Basically to return rows that have been modified since a point of time) */
};
typedef struct pbs_db_query_options pbs_db_query_options_t;

query_cb_t: Function pointer for call back function to process the data returned by the database.

typedef int (*query_cb_t) (pbs_db_obj_info_t *, int *);

...

Structure used to map database attr structure to C

...

. Flags field can be used to pass any flags to a query function. Timestamp field can be used to pass a timestamp, to return rows that have a modification timestamp newer (more recent) than the timestamp passed. (Basically to return rows that have been modified since a point of time)

The structure used to pass database query options to database functions

Code Block
struct pbs_db_query_options {
  int	flags; /* Flags field can be used to pass any flags to a query function. */
  time_t timestamp; /* Timestamp field can be used to pass a timestamp, to return rows that have a modification timestamp newer (more recent) than the timestamp passed. (Basically to return rows that have been modified since a point of time) */
};
typedef struct pbs_db_query_options pbs_db_query_options_t;

query_cb_t: Function pointer for call back function to process the data returned by the database. For each row/record returned by the search result, the function pointed by this pointer is called and the result is processed so that PBS can use it.

typedef int (*query_cb_t) (pbs_db_obj_info_t *, int *);


pbs_db_attr_list_t: Structure used to map database attr structure to C.

The structure used to map database attr structure to C

Code Block
struct pbs_db_attr_list {
	int attr_count;			     /* attribute count */
	pbs_db_attr_info_t *attributeslist_head attrs; /* ptr to attr_info structure */
};
typedef struct pbs_db_attr_list pbs_db_attr_list_t;

1.2. APIs from libdb.so

1.2.1. PBS Objects: Libdb APIs can work with below mentioned PBS objects.

PBS_DB_JOB: To save and update PBS job objects

...

PBS_DB_MOMINFO_TIME: To save/update PBS mominfo_time

1.2.2. pbs_db_connect

Description: Setup a persistent database connection for further use by APIs which will work on PBS objects. On success return the connection information.

Signature:

(int pbs_db_connect(void *conn_t *) pbs_db_connect(handle, char *pbs_ds_host, int pbs_ds_port, int timeout)

Parameters:

conn_db_handle[out]: This pointer will have connection information for the database on successfull connection.

pbs_data_service_host[in]: Hostname information where the database is running.

...

timeout[in]: Timeout in seconds before the API will return if it is taking too long to connect.

Returns:  Pointer to the initialized connection structure of type (pbs_db_conn_t *)

...

Error code

0: On successful connection setup to the database.

NULL!0: On connection failure.

1.2.3. pbs_db_disconnect

Description: Disconnect the PBS server from a database connection.

Signature:

int pbs_db_disconnect(pbsvoid *conn_db_conn_t *connhandle)

Parameters:

conn_db_handle[in]: Connection structure with connection handler to the database which was created by pbs_db_connectionconnect.

Returns: Returns status of the disconnection.

0 - for successful database disconnect.

-1 - for failure

1.2.4. pbs_db_save_obj

Description: PBS can use this API to save any of the PBS Objects listed above to the database. Save operation can be an insert or update.

Signature:

int pbs_db_save_obj(pbsvoid *conn_db_conn_t *connhandle, pbs_db_obj_info_t *obj, int savetype)

Parameters:

conn_db_handle[in]:The database connnection handle Connection handler to the database which was created by pbs_db_connectionconnect.

obj[in]: The PBS object to save. Based on the type defined within obj structure, right PBS object will be picked and dealt with.

...

1 - Execution succeeded but the statement did not affect any rows.

1.2.5. pbs_db_delete_obj

Description: This API lets PBS delete PBS objects PBS_DB_JOB, PBS_DB_RESV, PBS_DB_NODE, PBS_DB_QUE and PBS_DB_SCHED from the database.

Signature:

int pbs_db_delete_obj(pbsvoid *conn_db_conn_t *connhandle, pbs_db_obj_info_t *obj)

Parameters:

conn_db_handle[in]:The database connection handle Connection handler to the database which was created by pbs_db_connectionconnect.

obj[in]: PBS object information to be deleted. Based on the type defined within the obj structure, the right PBS object will be picked and dealt with.

...

1 - Success but no rows deleted

1.2.6. pbs_db_load_obj

Description: This API lets PBS load objects data from the database. This API can work with any of the PBS objects.
Signature:

int pbs_db_load_obj(pbsvoid *conn_db_conn_t *connhandle, pbs_db_obj_info_t *obj)

Parameters:

conn_db_handle[in] - The database connection handle : Connection handler to the database which was created by pbs_db_connectionconnect..
obj[in/out] - PBS object information to be loaded from the database. Based on the type defined within obj structure, right PBS object will be picked and dealt with.

...

0 - Success
1 - Success but no rows loaded

...

Description: This API can be used to find/search PBS objects PBS_DB_JOB, PBS_DB_RESV, PBS_DB_NODE, PBS_DB_QUE and PBS_DB_SCHED in the database. This API takes a pointer to the callback function as an argument that will work on the records returned from the database based on the query specified by the PBS.

Signature:

int pbs_db_find_obj(pbssearch(void *conn_db_conn_t *connhandle, pbs_db_obj_info_t *obj, pbs_db_query_options_t *opts, query_cb_t query_cb)

Parameters:

conn_db_handle[in]:The database connection handle Connection handler to the database which was created by pbs_db_connection.connect.

obj[in/out]: Information of the object to be found.

...

Returns: Error code

-1 - Failure

0 0> - Success
1 0 - Success but no rows found

1.2.8. pbs_db_del_attr_obj

Description: This API can be used to delete attributes of a PBS object.

Signature:

int pbs_db_deldelete_attr_obj(obj(void *conn, pbs_db_obj_conninfo_t *connobj, void *obj_id, char *sv_time, pbs_db_attr_list_t *db_attr_list)

Parameters:

conn_db_handle[in]:The database connection handle Connection handler to the database which was created by pbs_db_connectionconnect..

obj_id[in]: Object id for the delete attributes actionInformation of the object to be found.

svobj_timeid[in]: Save time stamp Object id for the delete operationattributes action.

attr_list[in]: List of attributes to be deleted from DB.

Returns: Error code

0 - Success

Non zero - On Failure

1.2.9. pbs_start_db

Description: This API can be used to start the database instance.

Signature:

int pbs_start_db((char *pbs_ds_host, int pbs_ds_port)

Returns: Error Status code

0 - Success

Non zero - On Failure

1.2.10. pbs_

...

stop_db

Description: This API can be used to shut down stop the database instance.

Signature:

int pbs_shutdownstop_db(char *pbs_ds_host, int pbs_ds_port)

Returns: Error Status code

0 - Success

Non zero - On Failure

1.2.11. pbs_status_db

Description: This API can be used to check if the database instance is running.

Signature:

int pbs_status_db(char *pbs_ds_host, int pbs_ds_port)

Returns: Error Status code

1 - Database is downnot running.

2 - Database is starting3 running on a different host.

0 - Database is uprunning on the localhost.

-1 - On Failure

1.2.12. pbs_db_password

Description: This API can be used to change the database user and password for the database instance.

Signature:

int pbs_db_password(pbsvoid *conn_db_conn_t *connhandle, char *user_name, char *password, int changechar *old_user)

Parameters:

conn_db_handle[in]:The database connection handle Connection handler to the database which was created by pbs_db_connectionconnect.

user_name[in]: Database user name.

password[in]: New password for the database.

changeold_user[in]: Takes values 0 or 1. When set to 1, user_name will be set as new old database user name used in case of -C option to change database user.

Returns: Error code

0 - Success

Non zero - On Failure

1.2.12.

...

pbs_db_get_errmsg

Description: This API can be used to get the error messages from the database library. When any of the above-listed APIs fail, this API can be used to get the error message from libdb.

Signature:

void getpbs_db_get_errmsg(int failcode, char **errmsg)

Parameters:

failcode[in]: Error code returned by one of the above APIs.

errmsg[out]: returns the db DB error message if any.