Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add configurable progress and session info indicators #751

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@

## NEXT RELEASE

* Added customization options for startup information and progress indicator

## 0.19.1

### Bug Fixes
Expand All @@ -16,7 +18,7 @@
### Features

* Added one internal magic to enable retry of session creation. Thanks @edwardps
* New `%%pretty` magic for pretty printing a dataframe as an HTML table. Thanks @hegary
* New `%%pretty` magic for pretty printing a dataframe as an HTML table. Thanks @hegary
* Update Endpoint widget to shield passwords when entering them in the ipywidget. Thanks @J0rg3M3nd3z @jodom961

## 0.18.0
Expand Down Expand Up @@ -112,4 +114,3 @@
* Updated code to work with Livy 0.5 and later, where Python 3 support is not a different kind of session. Thanks to Gianmario Spacagna for contributing some of the code, and G-Research for sponsoring Itamar Turner-Trauring's time.
* Fixed `AttributeError` on `None`, thanks to Eric Dill.
* `recovering` session status won't cause a blow up anymore. Thanks to G-Research for sponsoring Itamar Turner-Trauring's time.

37 changes: 37 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -172,6 +172,43 @@ If you want any registered livy sessions to be cleaned up on exit regardless of
}
```

## Notebook customizations

There are several ways in which sparkmagic gives feedback via the Jupyter notebook display system. This section
describes customizations available to modify the default behavior for such interactions, where applicable.

### Startup info table

When a session starts up, sparkmagic displays a table of session information including application name and the links to
sparkUI and logs. The display may be overridden by specifying a custom class:

```json
{
"startup_info_display_class": "module.path.classname"
}
```

The class should be a subclass of [StartupInfoDisplay](sparkmagic/sparkmagic/utils/startupinfo.py). It will be passed
the ipython_display, the LivySession object and the current session id. It should implement the `write_msg(msg)` method
to write a line of status output (by default this writes text to the current cell output), and the `display_info()`
method to show session information (by default, this displays an HTML table).

### Statement progress indicator

By default, sparkmagic uses the FloatProgress ipython widget to display the progress of a statement in the cell
output. If this is not desired, override the class used to construct the progress indicator:

```json
{
"progress_indicator_class": "module.path.classname"
}
```

The class should be a subclass of [ProgressIndicator](sparkmagic/sparkmagic/utils/progress.py) and will be passed the
session object and statement_id as arguments to the constructor. The `update(value_in_pct)` method will be called on
progress and the `close()` method will be called when the statement completes.


### Conf overrides in code

In addition to the conf at `~/.sparkmagic/config.json`, sparkmagic conf can be overridden programmatically in a notebook.
Expand Down
17 changes: 10 additions & 7 deletions sparkmagic/example_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -21,20 +21,20 @@
"logging_config": {
"version": 1,
"formatters": {
"magicsFormatter": {
"magicsFormatter": {
"format": "%(asctime)s\t%(levelname)s\t%(message)s",
"datefmt": ""
}
},
"handlers": {
"magicsHandler": {
"magicsHandler": {
"class": "hdijupyterutils.filehandler.MagicsFileHandler",
"formatter": "magicsFormatter",
"home_path": "~/.sparkmagic"
}
},
"loggers": {
"magicsLogger": {
"magicsLogger": {
"handlers": ["magicsHandler"],
"level": "DEBUG",
"propagate": 0
Expand All @@ -43,7 +43,7 @@
},
"authenticators": {
"Kerberos": "sparkmagic.auth.kerberos.Kerberos",
"None": "sparkmagic.auth.customauth.Authenticator",
"None": "sparkmagic.auth.customauth.Authenticator",
"Basic_Access": "sparkmagic.auth.basic.Basic"
},

Expand All @@ -63,15 +63,18 @@
"coerce_dataframe": true,
"max_results_sql": 2500,
"pyspark_dataframe_encoding": "utf-8",

"heartbeat_refresh_seconds": 30,
"livy_server_heartbeat_timeout_seconds": 0,
"heartbeat_retry_seconds": 10,

"server_extension_default_kernel_name": "pysparkkernel",
"custom_headers": {},

"retry_policy": "configurable",
"retry_seconds_to_sleep_list": [0.2, 0.5, 1, 3, 5],
"configurable_retry_policy_max_retries": 8
"configurable_retry_policy_max_retries": 8,

"progress_indicator_class": "sparkmagic.utils.progress.defaultProgressIndicator",
"startup_info_display_class": "sparkmagic.utils.startupinfo.defaultStartupInfoDisplay"
}
16 changes: 5 additions & 11 deletions sparkmagic/sparkmagic/livyclientlib/command.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@
import sparkmagic.utils.configuration as conf
from sparkmagic.utils.sparklogger import SparkLog
from sparkmagic.utils.sparkevents import SparkEvents
from sparkmagic.utils.utils import get_progress_indicator_class
from sparkmagic.utils.constants import MAGICS_LOGGER_NAME, FINAL_STATEMENT_STATUS, \
MIMETYPE_IMAGE_PNG, MIMETYPE_TEXT_HTML, MIMETYPE_TEXT_PLAIN, \
COMMAND_INTERRUPTED_MSG, COMMAND_CANCELLATION_FAILED_MSG
Expand All @@ -26,6 +27,7 @@ def __init__(self, code, spark_events=None):
if spark_events is None:
spark_events = SparkEvents()
self._spark_events = spark_events
self.progress_indicator_class = get_progress_indicator_class()

def __repr__(self):
return "Command({}, ...)".format(repr(self.code))
Expand Down Expand Up @@ -69,16 +71,8 @@ def execute(self, session):

def _get_statement_output(self, session, statement_id):
retries = 1
progress = FloatProgress(value=0.0,
min=0,
max=1.0,
step=0.01,
description='Progress:',
bar_style='info',
orientation='horizontal',
layout=Layout(width='50%', height='25px')
)
session.ipython_display.display(progress)

progress = self.progress_indicator_class(session, statement_id)

while True:
statement = session.http_client.get_statement(session.id, statement_id)
Expand All @@ -87,7 +81,7 @@ def _get_statement_output(self, session, statement_id):
self.logger.debug(u"Status of statement {} is {}.".format(statement_id, status))

if status not in FINAL_STATEMENT_STATUS:
progress.value = statement.get('progress', 0.0)
progress.update(statement.get('progress', 0.0))
session.sleep(retries)
retries += 1
else:
Expand Down
18 changes: 10 additions & 8 deletions sparkmagic/sparkmagic/livyclientlib/livysession.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
import sparkmagic.utils.constants as constants
from sparkmagic.utils.sparklogger import SparkLog
from sparkmagic.utils.sparkevents import SparkEvents
from sparkmagic.utils.utils import get_sessions_info_html
from sparkmagic.utils.utils import get_startup_info_display_class
from .configurableretrypolicy import ConfigurableRetryPolicy
from .command import Command
from .exceptions import LivyClientTimeoutException, \
Expand Down Expand Up @@ -118,6 +118,8 @@ def __init__(self, http_client, properties, ipython_display,
self.id = session_id
self.session_info = u""

self.startup_info_display = get_startup_info_display_class()

self._heartbeat_thread = None
if session_id == -1:
self.status = constants.NOT_STARTED_SESSION_STATUS
Expand All @@ -139,7 +141,8 @@ def start(self):
self.id = r[u"id"]
self.status = str(r[u"state"])

self.ipython_display.writeln(u"Starting Spark application")
startup_info = self.startup_info_display(self.ipython_display, [self], self.id)
startup_info.write_msg(u"Starting Spark application")

# Start heartbeat thread to keep Livy interactive session alive.
self._start_heartbeat_thread()
Expand All @@ -151,24 +154,23 @@ def start(self):
raise LivyClientTimeoutException(u"Session {} did not start up in {} seconds."
.format(self.id, conf.livy_session_startup_timeout_seconds()))

html = get_sessions_info_html([self], self.id)
self.ipython_display.html(html)
startup_info.display_info()

command = Command("spark")
(success, out, mimetype) = command.execute(self)

if success:
self.ipython_display.writeln(u"SparkSession available as 'spark'.")
startup_info.write_msg(u"SparkSession available as 'spark'.")
self.sql_context_variable_name = "spark"
else:
command = Command("sqlContext")
(success, out, mimetype) = command.execute(self)
if success:
self.ipython_display.writeln(u"SparkContext available as 'sc'.")
startup_info.write_msg(u"SparkContext available as 'sc'.")
if ("hive" in out.lower()):
self.ipython_display.writeln(u"HiveContext available as 'sqlContext'.")
startup_info.write_msg(u"HiveContext available as 'sqlContext'.")
else:
self.ipython_display.writeln(u"SqlContext available as 'sqlContext'.")
startup_info.write_msg(u"SqlContext available as 'sqlContext'.")
self.sql_context_variable_name = "sqlContext"
else:
raise SqlContextNotFoundException(u"Neither SparkSession nor HiveContext/SqlContext is available.")
Expand Down
19 changes: 13 additions & 6 deletions sparkmagic/sparkmagic/utils/configuration.py
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@
d = {}
path = join_paths(HOME_PATH, CONFIG_FILE)


def override(config, value):
_override(d, path, config, value)

Expand Down Expand Up @@ -60,8 +60,8 @@ def authenticators():
u"None": u"sparkmagic.auth.customauth.Authenticator",
u"Basic_Access": u"sparkmagic.auth.basic.Basic"
}


# Configs

def get_session_properties(language):
Expand All @@ -78,8 +78,8 @@ def session_configs():
@_with_override
def kernel_python_credentials():
return {u'username': u'', u'base64_password': u'', u'url': u'http://localhost:8998', u'auth': NO_AUTH}


def base64_kernel_python_credentials():
return _credentials_override(kernel_python_credentials)

Expand All @@ -99,7 +99,7 @@ def kernel_scala_credentials():
return {u'username': u'', u'base64_password': u'', u'url': u'http://localhost:8998', u'auth': NO_AUTH}


def base64_kernel_scala_credentials():
def base64_kernel_scala_credentials():
return _credentials_override(kernel_scala_credentials)

@_with_override
Expand Down Expand Up @@ -271,6 +271,13 @@ def kerberos_auth_configuration():
"mutual_authentication": REQUIRED
}

@_with_override
def progress_indicator_class():
return 'sparkmagic.utils.progress.defaultProgressIndicator'

@_with_override
def startup_info_display_class():
return 'sparkmagic.utils.startupinfo.defaultStartupInfoDisplay'
amitschang marked this conversation as resolved.
Show resolved Hide resolved
amitschang marked this conversation as resolved.
Show resolved Hide resolved

def _credentials_override(f):
"""Provides special handling for credentials. It still calls _override().
Expand Down
32 changes: 32 additions & 0 deletions sparkmagic/sparkmagic/utils/progress.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
from ipywidgets.widgets import FloatProgress, Layout

class ProgressIndicator:
def __init__(self, session, statement_id):
pass

def update(self, value):
pass

def close(self):
pass

class defaultProgressIndicator(ProgressIndicator):
amitschang marked this conversation as resolved.
Show resolved Hide resolved
def __init__(self, session, statement_id):
self.session = session
self.statement_id = statement_id
self.progress = FloatProgress(value=0.0,
min=0,
max=1.0,
step=0.01,
description='Progress:',
bar_style='info',
orientation='horizontal',
layout=Layout(width='50%', height='25px')
)
self.session.ipython_display.display(self.progress)

def update(self, value):
self.progress.value = value

def close(self):
self.progress.close()
20 changes: 20 additions & 0 deletions sparkmagic/sparkmagic/utils/startupinfo.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
from sparkmagic.utils.utils import get_sessions_info_html

class StartupInfoDisplay:
def __init__(self, ipython_display, sessions_info, current_session_id):
self.ipython_display = ipython_display
self.sessions_info = sessions_info
self.current_session_id = current_session_id

def write_msg(self, msg):
pass

def display_info(self):
pass
amitschang marked this conversation as resolved.
Show resolved Hide resolved

class defaultStartupInfoDisplay(StartupInfoDisplay):
amitschang marked this conversation as resolved.
Show resolved Hide resolved
def write_msg(self, msg):
self.ipython_display.writeln(msg)

def display_info(self):
self.ipython_display.html(get_sessions_info_html(self.sessions_info, self.current_session_id))
Loading