Professional Data Engineer on Google Cloud Platform

50%

Question 131

What are all of the BigQuery operations that Google charges for?
Storage, queries, and streaming inserts
Storage, queries, and loading data from a file
Storage, queries, and exporting data
Queries and streaming inserts




Answer is Storage, queries, and streaming inserts

All are charged, no charges for exporting and loading data in same region

Question 132

Which of these statements about BigQuery caching is true?
By default, a query's results are not cached.
BigQuery caches query results for 48 hours.
Query results are cached even if you specify a destination table.
There is no charge for a query that retrieves its results from cache.




Answer is There is no charge for a query that retrieves its results from cache.

A. By default, a query's results are not cached. (False)
B. BigQuery caches query results for 48 hours. (False - 24 hours)
C. Query results are cached even if you specify a destination table. False
When a destination table is specified in the job configuration, the Cloud Console, the bq command-line tool, or the API, the query results are not cached.
https://cloud.google.com/bigquery/docs/cached-results#cache-exceptions

D. There is no charge for a query that retrieves its results from cache. (True)

Reference:
https://cloud.google.com/bigquery/docs/cached-results#pricing_and_quotas

Question 133

Which of these sources can you not load data into BigQuery from?
File upload
Google Drive
Google Cloud Storage
Google Cloud SQL




Answer is Google Cloud SQL

Only Cloud sql is not available to be used for loading data directly.
On console we can see that we can use file, bigtable, drive and GS.

Question 134

How would you query specific partitions in a BigQuery table?
Use the DAY column in the WHERE clause
Use the EXTRACT(DAY) clause
Use the __PARTITIONTIME pseudo-column in the WHERE clause
Use DATE BETWEEN in the WHERE clause




Answer is Use the __PARTITIONTIME pseudo-column in the WHERE clause

Logical partition column is queried using _partitiontime

Reference:
https://cloud.google.com/bigquery/docs/partitioned-tables#ingestion_time

Question 135

Which SQL keyword can be used to reduce the number of columns processed by BigQuery?
BETWEEN
WHERE
SELECT
LIMIT




Answer is SELECT

Select can be used to only select few columns which will reduce query time and cost

Question 136

To give a user read permission for only the first three columns of a table, which access control method would you use?
Primitive role
Predefined role
Authorized view
It's not possible to give access to only the first three columns of a table.




Answer is Authorized view

Authorized views are used to provided restricted access Reference: https://cloud.google.com/iam/docs/understanding-roles
https://cloud.google.com/bigquery/docs/authorized-views

Question 137

What are two methods that can be used to denormalize tables in BigQuery?
1) Split table into multiple tables; 2) Use a partitioned table
1) Join tables into one table; 2) Use nested repeated fields
1) Use a partitioned table; 2) Join tables into one table
1) Use nested repeated fields; 2) Use a partitioned table




Answer is 1) Join tables into one table; 2) Use nested repeated fields

Denormalization says join tables to create one table and nested repeated fields to make query run faster Reference: https://medium.com/@guillaumelbr13/how-to-denormalize-tables-in-bigquery-cd1677c0aeab

Question 138

Which of these is not a supported method of putting data into a partitioned table?
If you have existing data in a separate file for each day, then create a partitioned table and upload each file into the appropriate partition.
Run a query to get the records for a specific day from an existing table and for the destination table, specify a partitioned table ending with the day in the format "$YYYYMMDD".
Create a partitioned table and stream new records to it every day.
Use ORDER BY to put a table's rows into chronological order and then change the table's type to "Partitioned".




Answer is Use ORDER BY to put a table's rows into chronological order and then change the table's type to "Partitioned".

Once table is created, you cannot change it partitioned

Question 139

Which of these numbers are adjusted by a neural network as it learns from a training dataset (select 2 answers)?
Weights
Biases
Continuous features
Input values




Answers are;
Weights
B. Biases


The two are adjust to create a perfect model

Question 140

Which TensorFlow function can you use to configure a categorical column if you don't know all of the possible values for that column?
categorical_column_with_vocabulary_list
categorical_column_with_hash_bucket
categorical_column_with_unknown_values
sparse_column_with_keys




Answer is categorical_column_with_hash_bucket

Vocabulary list is used when column values are known(incremental values are added as categorical columns starting from 0) and hash_bucket is used when you don’t know the values.

Reference:
https://www.tensorflow.org/tutorials/structured_data/feature_columns

< Previous PageNext Page >

Quick access to all questions in this exam

Warning: file_get_contents(http://www.geoplugin.net/php.gp?ip=216.73.216.106): failed to open stream: HTTP request failed! HTTP/1.1 403 Forbidden in /home/passnexa/public_html/view/question.php on line 243