-
Notifications
You must be signed in to change notification settings - Fork 511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HDDS-11947. [Docs] [User Guide] Impala Integration. #7584
base: master
Are you sure you want to change the base?
Conversation
Change-Id: Ic09028a519199274093186b611963ebb8bfb7879
c9689fd
to
382380d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @jojochuang for documenting this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding this @jojochuang. I think we should try to standardize the layout of the integration pages a little more. For example the way the proposed Impala and Hive pages mention supported protocols, replication types, bucket types, etc is different.
5fb5f29
to
f86262f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As an Impala committer, I think this looks good and appreciate having it added to the Ozone docs.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @jojochuang for updating the patch. Few formatting suggestions based on looking at the rendered version.
|
||
Starting with version 4.2.0, Apache Impala provides full support for querying data stored in Apache Ozone. To utilize this functionality, ensure that your Ozone version is 1.4.0 or later. | ||
|
||
# Supported Access Protocols |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Impala is rendered as h1, so I think these sections should be h2.
# Supported Access Protocols | |
## Supported Access Protocols |
|
||
Note: The o3fs protocol is **NOT** supported by Impala. | ||
|
||
# Supported Replication Types |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Supported Replication Types | |
## Supported Replication Types |
* RATIS (Replication) | ||
* Erasure Coding | ||
|
||
# Querying Ozone Data with Impala |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Querying Ozone Data with Impala | |
## Querying Ozone Data with Impala |
* Managed Tables | ||
* External Tables | ||
|
||
## Managed Tables |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
## Managed Tables | |
### Managed Tables |
* Note: Dropping an external table in Impala does not delete the associated data. | ||
|
||
|
||
# Using the S3A Protocol |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Using the S3A Protocol | |
## Using the S3A Protocol |
If the Hive Warehouse Directory is located in Ozone, you can execute Impala queries without any changes, treating the Ozone file system like HDFS. For example: | ||
|
||
```sql | ||
CREATE DATABASE d1; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The CREATE EXTERNAL TABLE
example is not indented, so I think the others can be unindented as well.
CREATE DATABASE d1; | |
CREATE DATABASE d1; |
``` | ||
|
||
```sql | ||
CREATE TABLE t1 (x INT, s STRING); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CREATE TABLE t1 (x INT, s STRING); | |
CREATE TABLE t1 (x INT, s STRING); |
You can create managed databases, tables, or partitions at a specific Ozone path using the `LOCATION` clause. Example: | ||
|
||
```sql | ||
CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; | |
CREATE DATABASE d1 LOCATION 'ofs://ozone1/vol1/bucket1/d1.db'; |
``` | ||
|
||
```sql | ||
CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; | |
CREATE TABLE t1 LOCATION 'ofs://ozone1/vol1/bucket1/table1'; |
Co-authored-by: Doroszlai, Attila <[email protected]>
Change-Id: Ia3bac190ab03fc275cde4f72d5d6f8fbf73bd1fd
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates Wei-Chiu. Format LGTM and @MikaelSmith has provided a good endorsement that the content is correct.
One thing to consider before merging, should we leave this page as draft
(a front matter key indicating it will not be published to the final site) until we get more integration pages completed? I think the current layout makes it look like Impala is the only supported Ozone integration.
Ah nvm i know what's going on. Updating it. |
Change-Id: I3006e8dc7bf8b0f1a55f16ccc3da8555c1d231e9
With |
What changes were proposed in this pull request?
HDDS-11947. [Docs] [User Guide] Impala Integration.
Adding the doc to the old user doc site.
Please describe your PR in detail:
What is the link to the Apache JIRA
https://issues.apache.org/jira/browse/HDDS-11947
How was this patch tested?
./hadoop-ozone/dev-support/checks/docs.sh passed.
https://github.com/jojochuang/ozone/actions/runs/12364445561