big-data
16Pages

{{requestButtons}}

Catalog excerpts

big-data - 3

Executive Summary Today the term big data draws a lot of attention, but behind the hype there's a simple story. For decades, companies have been making business decisions based on transactional data stored in relational databases. Beyond that critical data, however, is a potential treasure trove of non-traditional, less structured data: weblogs, social media, email, sensors, and photographs that can be mined for useful information. Decreases in the cost of both storage and compute power have made it feasible to collect this data which would have been thrown away only a few years ago. As a...

Open the catalog to page 3
big-data - 4

Introduction With the recent introduction of Oracle Big Data Appliance and Oracle Big Data Connectors, Oracle is the first vendor to offer a complete and integrated solution to address the full spectrum of enterprise big data requirements. Oracle’s big data strategy is centered on the idea that you can evolve your current enterprise data architecture to incorporate big data and deliver business value. By evolving your current enterprise architecture, you can leverage the proven reliability, flexibility and performance of your Oracle systems to address your big data requirements. Defining...

Open the catalog to page 4
big-data - 5

Value. The economic value of different data varies significantly. Typically there is good information hidden amongst a larger body of non-traditional data; the challenge is identifying what is valuable and then transforming and extracting that data for analysis. To make the most of big data, enterprises must evolve their IT infrastructures to handle the rapid rate of delivery of extreme volumes of data, with varying data types, which can then be integrated with an organization’s other enterprise data to be analyzed. The Importance of Big Data When big data is distilled and analyzed in...

Open the catalog to page 5
big-data - 6

Building a Big Data Platform As with data warehousing, web stores or any IT platform, an infrastructure for big data has unique requirements. In considering all the components of a big data platform, it is important to remember that the end goal is to easily integrate your big data with your enterprise data to allow you to conduct deep analytics on the combined data set. Infrastructure Requirements The requirements in a big data infrastructure span data acquisition, data organization and data analysis. The acquisition phase is one of the major changes in infrastructure from the days before...

Open the catalog to page 6
big-data - 7

generating aggregated results on the same cluster. These aggregated results are then loaded into a Relational DBMS system. Since data is not always moved during the organization phase, the analysis may also be done in a distributed environment, where some data will stay where it was originally stored and be transparently accessed from a data warehouse. The infrastructure required for analyzing big data must be able to support deeper analytics such as statistical analysis and data mining, on a wider variety of data types stored in diverse systems; scale to extreme data volumes; deliver...

Open the catalog to page 7
big-data - 8

Distributed File Systems Key/Value Stores DBMS (OLTP) ACQUIRE Flexible Specialized Developercentric MapReduce Solutions ETL ORGANIZE Data Warehouse Trusted Secure Administered ANALYZE Figure 1 Divided solution spectrum Distributed file systems and transaction (key-value) stores are primarily used to capture data and are generally in line with the requirements discussed earlier in this paper. To interpret and distill information from the data in these solutions, a programming paradigm called MapReduce is used. MapReduce programs are custom written programs that run in parallel on the...

Open the catalog to page 8
big-data - 9

Oracle’s Big Data Solution Oracle is the first vendor to offer a complete and integrated solution to address the full spectrum of enterprise big data requirements. Oracle’s big data strategy is centered on the idea that you can evolve your current enterprise data architecture to incorporate big data and deliver business value, leveraging the proven reliability, flexibility and performance of your Oracle systems to address your big data requirements. Figure 2 Oracle’s Big Data Solutions Oracle is uniquely qualified to combine everything needed to meet the big data challenge – including...

Open the catalog to page 9
big-data - 10

Figure 3 High-level overview of software on Big Data Appliance Oracle Big Data Appliance includes a combination of open source software and specialized software developed by Oracle to address enterprise big data requirements. The Oracle Big Data Appliance integrated software2 includes: Full distribution of Cloudera’s Distribution including Apache Hadoop (CDH) Cloudera Manager to administer all aspects of Cloudera CDH Open source distribution of the statistical package R for analysis of unfiltered data on Oracle Big Data Appliance Oracle NoSQL Database Community Edition3 And Oracle...

Open the catalog to page 10
big-data - 11

comprehensive set of open source software components needed to use Hadoop. Cloudera Manager is an end-to-end management application for CDH. Cloudera Manager gives a clusterwide, real-time view of nodes and services running; provides a single, central place to enact configuration changes across the cluster; and incorporates a full range of reporting and diagnostic tools to help optimize cluster performance and utilization. Oracle Big Data Connectors Where Oracle Big Data Appliance makes it easy for organizations to acquire and organize new types of data, Oracle Big Data Connectors enables...

Open the catalog to page 11
big-data - 12

Enterprises that are already using a Hadoop solution, and don’t need an integrated offering like Oracle Big Data Appliance, can integrate data from HDFS using Big Data Connectors as a standalone software solution. Oracle R Connector for Hadoop is an R package that provides transparent access to Hadoop and to data stored in HDFS. R Connector for Hadoop provides users of the open-source statistical environment R with the ability to analyze data stored in HDFS, and to run R models at scale against large volumes of data leveraging MapReduce processing – without requiring R users to learn yet...

Open the catalog to page 12

All Oracle catalogs and technical brochures

  1. osb

    2 Pages

  2. MySQL

    4 Pages

  3. 036083

    4 Pages

  4. DB

    2 Pages

  5. erp-cloud

    7 Pages

  6. sparc-t4-4

    4 Pages

  7. solaris-zfs

    3 Pages

  8. solaris-10

    5 Pages

  9. berkeley-db

    2 Pages

  10. berkeley-db

    2 Pages

  11. exadata

    2 Pages

  12. 034782

    5 Pages

  13. dms-11g

    4 Pages

Archived catalogs