Domo
Store your event and dispatch data in Domo through S3 or Google Storage integration
Domo Data Warehouse Integration
The Domo Data Warehouse integration provides a powerful solution for storing and analyzing your event, dispatch, and visitor data. We utilize S3 or Google Cloud Storage as an intermediary storage layer, allowing you to easily import the data into Domo's cloud-based business intelligence platform.
How the Integration Works
- Daily Cloud Storage Deposits: Events and dispatches are automatically deposited into your S3 bucket or Google Cloud Storage bucket on a daily basis
- Visitor Updates: Visitor data contains records where the last seen timestamp is greater than or equal to yesterday, requiring upsert processing
- Complete Data: All events and dispatches from your account are included in the deposits
- Flexible Import: You can configure Domo to read directly from your S3 bucket or Google Cloud Storage bucket
- Real-time Analytics: Domo's platform enables fast querying and visualization of your data
Data Organization
The data in your cloud storage bucket is organized in a partitioned structure:
your-bucket/
├── events/
│ └── YYYY/
│ └── MM/
│ └── DD/
│ └── *.parquet
├── dispatches/
│ └── YYYY/
│ └── MM/
│ └── DD/
│ └── *.parquet
└── visitors/
└── YYYY/
└── MM/
└── DD/
└── *.parquet
This partitioning by year/month/day makes it easy to:
- Query specific time periods efficiently
- Manage data retention policies
- Process historical data in batches
- Use partition projections for optimized querying
Data Processing Considerations
Events and Dispatches
Events and dispatches are complete daily snapshots containing all data for that day. Each day's parquet files contain all events and dispatches that occurred on that specific date.
Visitors
Visitor data contains records that have been recently updated. This means you'll need to implement an upsert process to merge this incremental data into your Domo datasets:
- Read the parquet files from the visitors directory for the current day
- Identify existing records in your Domo dataset using visitor identifiers
- Update existing records with new information from the parquet files
- Insert new records for visitors that don't exist in your dataset
- Handle conflicts based on your business logic (e.g., latest timestamp wins)
This incremental approach ensures you have the most up-to-date visitor information while maintaining data consistency across your Domo analytics infrastructure.
Getting Started
To set up the Domo Data Warehouse integration:
- Contact your account manager to enable the integration and provide you with the required permissions for your cloud storage bucket
- Configure your S3 bucket or Google Cloud Storage bucket to receive the daily data deposits
- Set up Domo access to your cloud storage using one of the methods below
Once configured, your event, dispatch, and visitor data will be automatically deposited into your cloud storage bucket daily, ready for import into Domo. Remember to implement the appropriate upsert logic for visitor data to maintain data consistency in your target systems.
Setting Up Domo Access to Cloud Storage
Domo provides integration options for both S3 and Google Cloud Storage:
Amazon S3 Integration
Domo can connect to your S3 bucket to import data directly. For detailed setup instructions, refer to Domo's S3 documentation.
Google Cloud Storage Integration
Domo also supports Google Cloud Storage for data import. For detailed setup instructions, refer to Domo's Google Cloud Storage documentation.
Data Import Process
Once configured, Domo will automatically:
- Monitor your cloud storage bucket for new parquet files
- Import new data on a scheduled basis (typically daily)
- Process the data according to your configured data flows
- Update your datasets with the latest information
- Refresh your dashboards and reports automatically
Key Features
Domo's integration enables powerful data processing capabilities:
- Automated Data Pipelines: Set up scheduled data imports from your cloud storage
- Data Transformation: Transform and clean your data using Domo's data flow tools
- Real-time Dashboards: Create interactive dashboards and reports
- Data Governance: Implement data quality checks and monitoring
- Collaboration: Share insights and reports with your team
- Mobile Access: Access your analytics on any device
Best Practices
- Ensure your cloud storage bucket has appropriate access policies (you will need to contact a member of the Ours Privacy team for this)
- Consider setting up lifecycle policies to manage data retention
- Use Domo's data flow tools to transform and clean your data
- Take advantage of the partitioning structure for efficient data loading
- Monitor your data import schedules and data quality
- Set up alerts for data pipeline failures or anomalies
Updated about 22 hours ago