Home > Mechanisms > Storage Device

Storage Device

Storage devices provide the underlying data storage environment for persisting the datasets that are processed by Big Data solutions. A storage device can exist as a distributed file system or a database.

Distributed file systems can be used for persisting immutable data that is intended for streaming access or batch processing. Databases, such as NoSQL repositories, can be used for structured and unstructured storage and read/write data access, as shown in Figure 2. Note that distributed file systems and databases are both on-disk storage devices.

Storage Device: Figure 1 - Structured data is imported into a storage device (1) using a data transfer engine (2). Unstructured data is imported (3) using another type of data transfer engine (4).

Figure 1 - Structured data is imported into a storage device (1) using a data transfer engine (2). Unstructured data is imported (3) using another type of data transfer engine (4).