Data warehousing
Data warehousing captures data from a variety of sources so it can be accessed and analyzed by business analysts, data scientists and other end users. One goal is to enhance data quality and consistency for analytics uses while improving business intelligence. Read how data warehousing provides these and other unique benefits to overall data management strategy.
Top Stories
-
News
04 Apr 2024
Aerospike raises $114M to fuel database innovation for GenAI
The vendor will use the funding to develop added vector search and storage capabilities as well as graph technology, both of which can be used to train generative AI models. Continue Reading
-
News
01 Apr 2024
Vector search and storage key to AWS' database strategy
The tech giant is prioritizing vector search and storage, adding the capabilities to its data storage tools so customers can use them with language models to build AI applications. Continue Reading
-
Definition
21 Mar 2024
big data
Big data is a combination of structured, semi-structured and unstructured data that organizations collect, analyze and mine for information and insights. Continue Reading
-
News
19 Mar 2024
Confluent adds support for Apache Flink, unveils update
The streaming specialist has added a managed service for the popular open source compute layer so customers can use the tools of their choice to develop a data ecosystem. Continue Reading
-
News
19 Mar 2024
Redpanda serverless streaming option targets cost control
The real-time data management specialist's new serverless platform enables customers to pay only for the compute power they use, which helps predict and manage spending. Continue Reading
-
Definition
19 Mar 2024
off-site backup
Off-site backup is a method of backing up data to a remote server or to media that's transported off-site. Continue Reading
-
Tip
18 Mar 2024
On-premises vs. cloud data warehouses: Pros and cons
Data warehouses increasingly are being deployed in the cloud. But both on-premises and cloud data warehouses have pluses and minuses to consider, as explained here. Continue Reading
-
News
14 Mar 2024
Databricks partners with Mistral AI to aid GenAI development
The data cloud vendor joins Microsoft and Snowflake in partnering with -- and investing in -- the startup to provide customers with access to Mistral's open source language models. Continue Reading
-
News
12 Mar 2024
Snowflake boosting its commitment to AI, including GenAI
Recent moves, including the appointment of a new CEO and the formation of a new partnership, are representative of the vendor's heightened focus on artificial intelligence. Continue Reading
-
News
11 Mar 2024
Ocient raises $49.4M in funding to fuel development, growth
The data platform vendor plans to use the new financing for product development and to fund sales and marketing efforts as it moves into its growth phase. Continue Reading
-
News
29 Feb 2024
Google Cloud unveils new GenAI-fueled data, analytics tools
The tech giant introduced extensive support for vector search and improved access to unstructured data while also making a pair of GenAI capabilities generally available. Continue Reading
-
News
29 Feb 2024
Snowflake CEO Slootman steps down, Ramaswamy takes over
Slootman resigns after five years at the helm of the data cloud vendor. Revenues grew fivefold under him and the company went public in a record-setting initial public offering. Continue Reading
-
News
28 Feb 2024
Alation broadens ecosystem with spate of new integrations
The data catalog specialist is adding integrations with Amazon DynamoDB, MongoDB and Apache Kafka -- among others -- to broaden the connectivity of its data ecosystem. Continue Reading
-
Definition
26 Feb 2024
data warehouse as a service (DWaaS)
Data warehouse as a service (DWaaS) is an outsourcing model in which a cloud service provider configures and manages the hardware and software resources a data warehouse requires, and the customer provides the data and pays for the managed service. Continue Reading
-
News
26 Feb 2024
Exasol gets jolt of AI with Espresso suite of capabilities
The analytics database vendor's trio of new AI capabilities, including the ability to pull in GenAI models, aims to improve the efficiency of decision-making. Continue Reading
-
News
20 Feb 2024
Generative AI dominates Google's data and analytics roadmap
Following recent integrations between Gemini and the tech giant's major data and analytics platforms, more product innovations featuring LLMs are expected over the next few months. Continue Reading
-
News
14 Feb 2024
Alteryx, Databricks expand complementary partnership
The expanded partnership features new integrations designed to better enable joint customers to combine self-service data preparation and analysis with data science and AI. Continue Reading
-
Feature
26 Jan 2024
Rise of generative AI heightens need for data quality
Good information has always been key to fuel good decisions, but with LLMs prone to hallucinations, it's critical that the models be trained with intelligence that can be trusted. Continue Reading
-
Feature
23 Jan 2024
Databricks lakehouse a key tool for champion Texas Rangers
The Rangers relied on the data management vendor's tools to develop metrics that led to key personnel and strategic decisions that helped Texas win its first World Series. Continue Reading
-
News
19 Jan 2024
Qlik's Kyndi acquisition targets unstructured data, GenAI
The longtime analytics vendor's latest purchase adds support for unstructured data that can be combined with structured data to inform GenAI models and improve decision-making. Continue Reading
-
Definition
02 Jan 2024
big data engineer
A big data engineer is an information technology (IT) professional who is responsible for designing, building, testing and maintaining complex data processing systems that work with large data sets. Continue Reading
-
Feature
27 Dec 2023
Vector search now a critical component of GenAI development
The feature's ability to make unstructured data discoverable as well as locate similar data points among potentially billions make it ideal for helping train generative AI models. Continue Reading
-
News
08 Dec 2023
Databricks intros suite to aid generative AI development
The data lakehouse vendor's new set of tools includes vector search and aims to enable users to develop data pipelines that train language models with proprietary data. Continue Reading
-
News
29 Nov 2023
New AWS tools simplify access, management of data at scale
The tech giant revealed serverless tools that eliminate limitations on workload size as well as integrations that simplify access to data within the Amazon Web Services ecosystem. Continue Reading
-
Feature
21 Nov 2023
American Airlines lowers data management costs with Intel
As the airline giant moves more of its data workloads to the cloud, tools from Intel's Granulate are making platforms such as Microsoft's Azure Data Lake more efficient. Continue Reading
-
News
09 Nov 2023
MongoDB unveils Atlas for Retail, Partner Ecosystem Catalog
The database vendor is making industry verticalization a significant part of its product roadmap to help customers in various industries develop applications that meet their needs. Continue Reading
-
News
25 Oct 2023
Neo4j updates graph database to improve performance speed
The vendor's latest update includes new parallel analytical query and transactional query runtime capabilities as well as automated real-time data tracking via change data capture. Continue Reading
-
News
04 Oct 2023
Databricks improves support for generative AI models
A new service enables users to easily deploy privately built language models and uses a GPU-based architecture to optimize and manage the models once on the lakehouse platform. Continue Reading
-
News
26 Sep 2023
MongoDB reveals new generative AI, vector search tools
After unveiling an integration with Google's LLM suite in June, the vendor moved a set of NLP tools into preview and introduced new data migration and vector search capabilities. Continue Reading
-
News
14 Sep 2023
Databricks lands $500M in new funding, now valued at $43B
The data lakehouse pioneer recently acquired MosaicML for $1.3 billion and could use the new capital to finance acquisitions and further investment in generative AI. Continue Reading
-
News
14 Sep 2023
Dremio launches updated SQL query acceleration capabilities
The data lakehouse specialist's new version of its SQL query acceleration tool, Reflections, includes automated recommendations and automatic data refresh capabilities. Continue Reading
-
News
06 Sep 2023
InfluxData launches new database for self-managed users
The time series database vendor completed the InfluxDB 3.0 product line with the release of InfluxDB Clustered, a version tailored for private cloud and on-premises deployments. Continue Reading
-
News
30 Aug 2023
Couchbase intros generative AI feature for its Capella DBaaS
The database vendor's new tool -- now in private preview -- uses LLM technology to make application developers more efficient by helping them more easily generate code. Continue Reading
-
News
28 Aug 2023
Google unveils generative AI integrations for data tools
Integrations between Duet AI and both Looker and BigQuery are among the features and integrations the tech giant is planning for its data and analytics tools. Continue Reading
-
Definition
08 Aug 2023
dimension
In data warehousing, a dimension is a collection of reference information that supports a measurable event, such as a customer transaction. Continue Reading
-
News
24 Jul 2023
Oracle targets speed with launch of MySQL HeatWave Lakehouse
The tech giant's new lakehouse enables users of its database management suite to combine structured and unstructured data to develop a more complete view of their operations. Continue Reading
-
News
19 Jul 2023
Lakehouse architecture the best fit for modern data needs
While data warehouses and data lakes each excel at handling certain types of data, a hybrid of the two is the best means of handling the increasing complexity of data management. Continue Reading
-
News
18 Jul 2023
Confluent partner plan aids streaming data platform delivery
The vendor's Connect With Confluent program enables technology partners to deliver event data to end users in real time through integrations with Confluent Cloud. Continue Reading
-
News
12 Jul 2023
Collins Aero reducing flight delays with Databricks platform
The data lakehouse vendor's tools form the foundation of analytics products designed to help airlines predict and prevent maintenance that results in delays and cancellations. Continue Reading
-
News
11 Jul 2023
Teradata makes VantageCloud Lake available on Azure
By making its cloud-native platform natively available on Azure, the data management and analytics vendor aims to more smoothly enable users to run machine learning and BI tasks. Continue Reading
-
News
07 Jul 2023
Dremio names former Splunk executive new CEO
The former Splunk executive takes over as the data lakehouse vendor's leader, aiming to raise the company's profile to demonstrate its capabilities and compete for market share. Continue Reading
-
Feature
06 Jul 2023
Generative AI hype evolving into reality in data, analytics
Organizations are already beginning to apply the technology to their data operations, helping expand analytics use to more employees and boosting the efficiency of data experts. Continue Reading
-
News
28 Jun 2023
Databricks introduces Delta Lake 3.0 to help unify data
As part of the open source community developing the data storage platform, the vendor unveiled the platform's latest iteration with data unification the main goal of the update. Continue Reading
-
News
27 Jun 2023
Snowflake targets generative AI with new capabilities
The vendor unveiled new features -- including new containerization capabilities in Snowpark -- to create a secure environment for developers to build LLM-infused applications. Continue Reading
-
News
26 Jun 2023
Databricks acquiring MosaicML to add more generative AI
The data lakehouse vendor's purchase of the generative AI vendor will enable customers to build and train language models specific to their needs by using their own data. Continue Reading
-
News
22 Jun 2023
MongoDB unveils new AI, migration tools for database
The vendor, with its latest slate of new and updated capabilities, is adding generative AI with its partnership with Google Cloud and launching a new data migration tool. Continue Reading
-
News
20 Jun 2023
Starburst Galaxy update targets governance, data access
The vendor's latest update includes the public preview of Gravity, a centralized access and governance layer that enables users to better control and connect data across clouds. Continue Reading
-
News
16 Jun 2023
Dremio adds first generative AI-infused tool, intros others
The vendor's initial generative AI-infused tool is Text-to-SQL, which enables customers to work with data using natural language that automatically gets translated to code. Continue Reading
-
News
07 Jun 2023
Ascend.io, Databricks integration improves data visibility
The update includes support for Databricks' Unity Catalog to enable joint customers to better organize and view datasets that can be used to inform data science and BI projects. Continue Reading
-
News
06 Jun 2023
Snowflake launches Government & Education Data Cloud
The industry-specific platform is the vendor's seventh and includes data sets and other pre-built capabilities to meet the needs of government agencies and educational institutions. Continue Reading
-
News
06 Jun 2023
Collibra update targets data quality, lineage and discovery
The data management vendor's Data Intelligence Cloud now includes pushdowns that enable work within Snowflake and Databricks and prebuilt workflows focused on data visibility. Continue Reading
-
Feature
02 Jun 2023
Data mesh helping fuel Sloan Kettering's cancer research
The cancer hospital and research center began using tools from data management vendor Dremio two years ago to decentralize its data operations and improve speed-to-insight. Continue Reading
-
Definition
31 May 2023
data lakehouse
A data lakehouse is a data management architecture that combines the key features and the benefits of a data lake and a data warehouse. Continue Reading
-
News
25 May 2023
Snowflake acquisition of Neeva to add generative AI
The pending purchase will let the data cloud vendor infuse generative AI throughout its data management suite and potentially open the technology's use to a broader audience. Continue Reading
-
Feature
17 May 2023
Peloton rides, runs, rows with AWS for data management
The connected fitness company has long used AWS tools. When its data volume surged during COVID-19, Redshift was critical -- and still is as the company attempts a fiscal comeback. Continue Reading
-
Tip
15 May 2023
Mainframe databases teach an old dog new survival tricks
Long predicted to fade away in favor of more modern architectures, mainframes still play an integral role in corporate IT strategies, thanks to advances in database software. Continue Reading
-
News
24 Apr 2023
IBM acquires Ahana, steward of open source PrestoDB
The purchase not only gives IBM a managed SaaS and AWS marketplace version of the popular open-source Presto database, but membership in the Presto Foundation as well. Continue Reading
-
News
18 Apr 2023
Rockset adds vector embedding support to real-time database
The real-time database vendor now enables users to search and combine unstructured data with structured and semi-structured data to provide more in-depth modeling and analysis. Continue Reading
-
News
13 Apr 2023
Snowflake launches data cloud designed for manufacturers
The vendor's latest industry-specific platform -- its sixth -- is aimed at manufacturers and comes with best practices, a set of prebuilt applications and access to external data. Continue Reading
-
News
05 Apr 2023
Databricks launches new lakehouse for manufacturing
The platform for manufacturing companies comes with predictive maintenance and digital twin capabilities and is the vendor's fifth tool designed for a specific industry. Continue Reading
-
News
04 Apr 2023
Alation unveils enhanced partnerships with Databricks, DBT
The data catalog vendor launched new connectors with its partners designed to help joint customers better understand data in their lakehouses and more easily transform the data. Continue Reading
-
Opinion
16 Mar 2023
Data lakes: Key to the modern data management platform
Data lakes influence the modern data management platform at all levels. Organizations can gain faster insights, save costs, improve governance and boost self-service data access. Continue Reading
-
News
06 Mar 2023
New high-volume agent connectors highlight Fivetran update
The data ingestion specialist's latest platform update focuses on enabling users to ingest high volumes of data to fuel real-time analysis and adds a new private deployment option. Continue Reading
-
Definition
28 Feb 2023
data warehouse
A data warehouse is a repository of data from an organization's operational systems and other sources that supports analytics applications to help drive business decision-making. Continue Reading
-
News
22 Feb 2023
Snowflake launches industry-specific data cloud for telecom
Designed specifically for telecom companies, the tool comes with prepackaged data sets and capabilities to enable quick onboarding and efficient data management and analytics. Continue Reading
-
News
21 Feb 2023
Google's Wright names embedded BI top analytics trend
While AI has developed into an important aid for making decisions, infusing data into the workflows of business users in real time is the most significant movement in BI now. Continue Reading
-
News
10 Feb 2023
InfluxData raises $81M to advance time series capabilities
The vendor is the creator and lead sponsor of the open source InfluxDB database and plans to use the new funding to further product development as it aims for profitability. Continue Reading
-
News
30 Jan 2023
Expanded AtScale, Databricks integration adds functionality
The semantic layer platform vendor's tools are now listed on Databricks' Partner Connect, and existing customers can now connect to Databricks SQL and Unity Catalog. Continue Reading
-
Feature
25 Jan 2023
Data lake vs. data warehouse: Key differences explained
Data lakes and data warehouses are both commonly used in enterprises. Here are the main differences between them to help you decide which is best for your data needs. Continue Reading
-
News
20 Jan 2023
Snowflake acquires Mobilize.net tools to aid cloud migration
The vendor purchased SnowConvert, a set of tools designed to automate some of the onerous coding work required to migrate data out of on-premises databases to the cloud. Continue Reading
-
Definition
29 Dec 2022
DataOps
DataOps is an Agile approach to designing, implementing and maintaining a distributed data architecture that will support a wide range of open source tools and frameworks in production. Continue Reading
-
Definition
21 Dec 2022
data mesh
Data mesh is a decentralized data management architecture for analytics and data science. Continue Reading
-
Feature
20 Dec 2022
Data forecast for 2023: Time to extract more value
Expect more organizations to optimize data usage to drive decision intelligence and operations in 2023, as the new year will be one of economic challenges for many. Continue Reading
-
Feature
06 Dec 2022
AWS analytics tools help French utility go green
The cloud computing giant's suite enabled Engie SA to transition away from fossil fuels and now helps the French utility manage a network of renewable energy sources. Continue Reading
-
Feature
05 Dec 2022
What is a data warehouse analyst?
Data warehouse analysts help organizations manage the repositories of analytics data and use them effectively. Here's a look at the role and its responsibilities. Continue Reading
-
News
30 Nov 2022
AWS adds data quality, scalability services for cloud data
The cloud giant expanded its data portfolio with a series of features designed to help organizations more easily scale database and data warehouse deployments. Continue Reading
-
News
29 Nov 2022
AWS expands cloud data options with Amazon DataZone
The cloud computing giant at its AWS re:Invent 2022 conference introduced a series of new capabilities to help organizations better integrate and manage data across services. Continue Reading
-
News
28 Nov 2022
Alation Connected Sheets extends data intelligence platform
The data intelligence vendor's Connected Sheets lets spreadsheet users directly pull in data sets from a data catalog to improve data governance and visibility. Continue Reading
-
News
18 Nov 2022
TigerGraph Cloud update adds ML, data visualization tools
The latest from the graph database vendor includes a feature that enables users to build visuals without writing code and another that lets data scientists use familiar tools. Continue Reading
-
News
16 Nov 2022
Alteryx launches SaaS version of Designer in Analytics Cloud
The fully cloud-native version of Designer furthers the vendor's move toward the cloud, which began in early 2022 with the launch of its first cloud-based suite of tools. Continue Reading
-
News
07 Nov 2022
Snowflake data cloud adds Python, multi-cloud collaboration
The cloud data vendor released preview updates to its platform to accelerate data queries, better support multi-cloud operations and boost developer productivity. Continue Reading
-
News
11 Oct 2022
Google grows data cloud capabilities for data management
The tech giant brings open source Apache Iceberg table format support to its BigLake data lake as it extends BigQuery support for unstructured data and Apache Spark. Continue Reading
-
Feature
04 Oct 2022
How to design a data architecture for business success
To gain business value from data, enterprises need to get their data architecture right – and the right business leadership and culture is critical to that Continue Reading
-
News
28 Sep 2022
Qlik unveils pair of new integrations with Databricks
One is designed to enable joint users to easily ingest data into lakehouses, while the other aims to enable potential users to experiment with the platforms on a trial basis. Continue Reading
-
Feature
23 Sep 2022
How Lufthansa is flying its data warehouse to the cloud
Moving from an on-premises data system to the cloud can be a complex operation. Lufthansa is looking to remove some of the complexity with virtualization. Continue Reading
-
News
22 Sep 2022
Google launches trio of new tools for its data cloud
The data management and analytics tools, including new data sharing and data lake platforms, are designed to let users access more data at lower expense. Continue Reading
-
News
15 Sep 2022
Google eases cloud database migration, improves Datastream
Google wants to make migrating to its new AlloyDB database easier for its users as well as provide new support for database migration from PostgreSQL. Continue Reading
-
Feature
15 Sep 2022
Ricoh modernizes its analytics with Qlik
The information management and digital services company is beginning to develop a data culture, and the BI vendor's platform has enabled new efficiencies. Continue Reading
-
News
08 Sep 2022
Snowflake, UiPath launch integration to automate data prep
The collaboration between the data cloud vendor and robotic process automation vendor will enable joint customers to automate data pipelines used to fuel business applications. Continue Reading
-
News
30 Aug 2022
Alation adds Snowflake service, updates data catalog
The vendor launched the Alation Cloud Service for Snowflake designed to enable Snowflake users to more easily use Alation's data intelligence capabilities. Continue Reading
-
News
29 Aug 2022
Teradata launches cloud-native platform, enhances BI suite
VantageCloud Lake is designed to enable wider use across organizations along with better cost control, while ClearScape Analytics adds more than 50 new capabilities. Continue Reading
-
News
24 Aug 2022
New SAS, SingleStore integration boosts speed, efficiency
The integration between the longtime data and analytics vendor and the upstart database vendor enables users to work with data in-database, which increases speed-to-insight. Continue Reading
-
Tip
23 Aug 2022
Cloud database comparison: AWS, Microsoft, Google and Oracle
Here's a look at the rival cloud database offerings from AWS, Google, Microsoft and Oracle based on their product breadth, migration capabilities and pricing models. Continue Reading
-
News
17 Aug 2022
Cloudera users get fully managed data lakehouse platform
The vendor is expanding its set of offerings with the launch of CDP One, a service initially available only on AWS that enables serverless deployment in the cloud. Continue Reading
-
News
11 Aug 2022
GridGain, Apache Ignite founder talks in-memory databases
Nikita Ivanov details the origin of his company and discusses the growing need organizations have for real-time database processing capabilities to complete modern transactions. Continue Reading
-
Feature
09 Aug 2022
A look at Presto, Trino SQL query engines
The co-creator of the open source project at Facebook reflects on 10 years of growth as he helps lead one of its resulting tools into the future. Continue Reading
-
News
23 Jun 2022
Starburst acquires Varada to accelerate data lake queries
After a year of partnering, the data lake query vendor decided to acquire fellow Trino SQL query engine supporter Varada to help boost query performance. Continue Reading
-
Definition
16 Jun 2022
Elastic Stack (ELK Stack)
The Elastic Stack is a group of open source products from Elastic designed to help users take data from any type of source and in any format, and search, analyze and visualize that data in real time. Continue Reading
-
News
14 Jun 2022
Yellowbrick 6 advances cloud data warehouse deployments
The data warehouse vendor is growing its hybrid data warehouse capabilities with version 6.0 of its namesake platform that is now enabled to run on AWS. Continue Reading
-
News
14 Jun 2022
Snowflake Data Cloud expands with Unistore Hybrid Tables
Snowflake adds more capabilities, including support for Apache Iceberg data lake tables and both transactional and analytics workloads, with Hybrid Tables. Continue Reading
-
News
09 Jun 2022
Datamart a new self-service developer tool in Power BI
The capability, now in in preview, will enable Premium customers to build small to midsize relational databases dedicated to a single subject, without requiring code. Continue Reading
-
News
26 May 2022
Tech stock sell-off signals tough times for data vendors
In addition to lowering the values of publicly traded data, analytics and AI vendors, the stock market's decline is making it difficult for companies still in their funding phase. Continue Reading