Blockchain Data Processing Technology Learning
Glossary
Term DefinitionBlockchain is a decentralized distributed database with the characteristics of transparency, immutability and trustworthiness.Decentralized Node is a node outside the blockchain system that connects to different data sources and provides data to blockchain nodes.Blockchain Node is a node in the blockchain system that verifies and stores data.Query Parameter is a keyword or identifier used to query data.Query Result Data is the original data obtained from various data sources.Target Reliable Data is data with higher credibility after conversion by the data reliability protocol.Data Reliability Protocol is a protocol used to verify data reliability, such as the directed acyclic graph (DAG) protocol.Smart Contract is a contract code stored on the blockchain that can be executed automatically.Multi-dimensional Computing is a comprehensive calculation of data from multiple data sources.Fabric Cluster is a blockchain node cluster based on the Hyperledger Fabric platform.ETH Cluster is a blockchain node cluster based on the Ethereum platform. ReadWriteSet data A data set that records data read and write operations in a Fabric cluster. Simplified Payment Verification (SPV) data A lightweight data structure used to verify transactions in the Ethereum platform. OSS Cluster A server cluster of the Operation Support System (OSS) that stores non-blockchain data. HDFS Cluster A server cluster of the Hadoop Distributed File System (HDFS) that stores non-blockchain data.
Short Answer Questions
Briefly describe the characteristics of blockchain technology.
How does a blockchain system ensure data reliability?
What is a decentralized node? What role does it play in data processing?
What is the role of a data reliability protocol? Please list two data reliability protocols.
Briefly describe the differences between Fabric clusters, ETH clusters, and OSS clusters.
In which types of blockchain platforms are ReadWriteSet data and SPV data stored?
Why is it necessary to convert query result data into target reliable data?
What role does smart contract play in blockchain data processing?
What is multidimensional computing? How is it different from traditional computing methods?
Briefly describe the advantages of blockchain technology in data processing.
Answers to short-answer questions
Blockchain technology is a decentralized distributed database technology with characteristics including decentralization, transparency, immutability and trustworthiness. Once data is written to the blockchain, it is difficult to tamper with, and all participants can view the data and transaction history.
The blockchain system ensures the reliability of data through cryptography, consensus mechanism and distributed storage. Each block contains the hash value of the previous block, and any tampering will cause the hash value to change, which will be rejected by the entire network.
A decentralized node is a node outside the blockchain system that is used to connect different data sources and provide data to blockchain nodes. It plays the role of data acquisition and conversion in the data processing process, converting data in different formats into a format that can be recognized and stored by the blockchain.
The data reliability protocol is used to verify data reliability and ensure the authenticity and integrity of the data. Two common data reliability protocols include the directed acyclic graph (DAG) protocol and the Merkle DAG protocol.
Fabric cluster is a permissioned blockchain based on Hyperledger Fabric platform, mainly used for enterprise-level applications; ETH cluster is a public chain or consortium chain based on Ethereum platform, with a wider range of applications; OSS cluster is a traditional centralized storage system, which does not belong to the scope of blockchain technology.
ReadWriteSet data is stored in the blockchain based on Hyperledger Fabric platform to record data read and write operations; SPV data is stored in Ethereum platform to verify transactions in a lightweight way.
Because the data format and reliability of different data sources may be different, in order to ensure the reliability and consistency of blockchain data, the query result data needs to be converted into target reliable data that conforms to a specific data reliability protocol.
Smart contracts are stored on the blockchain and can be automatically executed according to pre-set rules, such as data verification, multi-party collaboration, etc., thereby improving the efficiency and security of data processing.
Multidimensional computing refers to the comprehensive calculation of data from multiple data sources. It breaks through the limitation that traditional computing methods can only process a single data source, and can analyze and utilize data more comprehensively.
Blockchain technology has the advantages of decentralization, transparency, security, and traceability in data processing, which can improve data reliability, reduce data management costs, and promote data sharing and collaboration.