About RDFs and CDFs
The raw data files (RDFs) and custom data files (CDFs) are text files that you receive from Galileo once per day. These files contain low-level extracts of the prior day’s activity from Galileo's databases.
The files are securely delivered each day. With the raw data you can build your own data store to compare with your production database, build statements and reports, and provide your customers with long-term transaction histories.
The RDFs comprise a standard suite that is offered as a unit:
- Customer Master — A list of primary accounts, their balances, and customer information
- Account Card — A list of cards issued
- Authorized Transactions — Authorization activity
- Posted Transactions — Settlements, payments, adjustments, and fees
- Lookup — A lookup file for the other RDFs
See Standard RDF Reference for a list of the fields for each file.
Galileo now offers a suite of secondary RDFs that contain all possible fields for each of the standard RDFs. All new clients (beginning Feb 2023) will receive this secondary RDF suite, and existing clients can request to receive the secondary suite instead of the standard suite.
You can also request one or more CDFs:
- Chargebacks and Disputes — Enhanced chargeback and dispute data that is similar to the Dispute Tracker Log in gAnalytics.
- Overdraft — Enhanced overdraft data beyond what the Posted Transactions RDF shows
- Authorization Expiries — Unsettled authorizations that have expired from the authorization database
- Early ACH — ACH details as well as number of days early the transaction posted
- ACH Returns — Details on returned ACH transactions
- Incoming ACH — Details on incoming ACH transactions
- Settlement — Similar to the Settlements - Bin Summary report in gAnalytics, this data looks back one day for Visa, Mastercard and Discover, three days for Maestro/Cirrus, and two days for Pulse, Allpoint and Star. In contrast, the gAnalytics report looks back one day for all networks.
- Mastercard SAFE Report — The SAFE report that was sent to Mastercard over the last seven days
- Fleet Data — Data specific to fleet cards such as vehicle numbers and fuel purchases.
- Other data — Any specialty data sets that Galileo can set up for you.
See CDF Reference for a list of the fields for each file.
RDFs and CDFs are available in Production but not in CV.
Galileo has sample RDFs that you can see for example values and formats. CDF samples are not available. In addition, you can consult the card transaction scenarios to see examples of how different transaction types are represented in the RDFs.
Accessing the files
The RDFs and CDFs are generated early every morning and contain activity from the previous calendar day, which is from 00:00:00 (midnight) to 23:59:59 Galileo system time.
- Because many different files are generated every day, availability times are staggered throughout the morning. Consult with Galileo for the times when you can expect your files to be ready.
- On days when maintenance upgrades are performed, the availability time may be delayed.
- In general, CDFs are available a few hours after the RDFs.
Galileo uses Secure File Transfer Protocol (SFTP) to deliver RDFs and CDFs. You can arrange to retrieve the files from Galileo's SFTP server or Galileo can push the files to your SFTP server. Coordinate with Galileo to get an account name and password.
RDF and CDF filenames have this format. The date represents the day when the file was generated, so a file that is dated
20231010 contains data from October 9, 2023:
The files are PGP-encrypted using your public PGP key. The standard RDFs are fixed-width files, which means that each field takes a fixed amount of space, including trailing spaces to pad out the field. Secondary RDFs and CDFs are comma-separated value files (CSVs). Galileo will provide you with a document that specifies the layouts for your program, including field sizes and data types. You can also access the latest version of your RDF layouts from the gAnalytics (Galileo Analytics) tool. Only the standard RDF layouts are available in gAnalytics.
If there is no activity in your program during a calendar day, you will receive files that contain headers and footers but no records.
If there is a system or connectivity failure that prevents the files from being delivered at the usual time, the files will be delivered as soon as it is possible instead of waiting until the next day.
Some of the fields contain PCI-sensitive information. If you are PCI compliant, you have the option of requesting that these values be masked or replaced by spaces. If you are not PCI-compliant, these values will be replaced by spaces:
ID 2, when
ID TYPE 2is SSN (Social Security number)
DATE OF BIRTH
Building a datastore
Galileo strongly recommends that you use the RDFs to create a datastore of transactions and customer records that is separate from the production database that you create from the Events API or Program API. (For more information on building a production database see the Transaction History guide.)
Use the RDF-based datastore to:
- Ensure that there are no missing or incorrect transactions in your production database. Where there are discrepancies between the RDFs and your records, you should consider the RDFs to be authoritative.
- Run reports and perform data analysis, where large queries would overly tax your live production environment.
- Produce longer-term transaction histories in your app or on your website instead of using costly endpoint calls to Get Transaction History or other Program API endpoints.
Use the CDF-based datastore to add details to the transactions in the Posted Transactions RDF, such as overdraft, ACH and expired authorizations.
Because RDFs and CDFs are daily differential files instead of complete histories, you can read the files into your own data histories every day, which allows you to maintain a complete history by stitching together each day’s changes.
Keep in mind that when you are the system of record, the Galileo system may not have all of your transactions, depending on which transactions you report to Galileo. For example, Galileo may not have your card loads or other deposits.
Performing queries across tables
With each RDF in its own table you can perform complex queries. If you need to join two RDF tables in a query, join on these two fields:
UNIQUE PROGRAM ID
GALILEO ACCOUNT ID
When joining CDF-based tables, select fields that uniquely identify the account or transaction:
network_codeuniquely identifies card transactions
pmt_ref_nouniquely identifies an account
bal_iduniquely identifies balances, though you should keep in mind that multiple accounts can transact on the same balance
See Creating Statements from the RDFs for instructions on using your RDF datastore to build customized statements for your customers.
Consult RDF Reference, Secondary RDF Reference, or CDF Reference for individual field descriptions as well as other details about each file.
Updated 29 days ago