Sentinel Collector SDK 2014 Updates: Collector Documentation

Sentinel Collector SDK 2014 Updates: Collector Documentation

This article is part of a series of articles diving into the new functionality present in the 2014 Preview Sentinel SDK. Today I will be covering the new documentation options and what they mean for those who have developed collectors previously, as well as all of us developing collectors now and in the future. As in other articles, it may be useful to dive into some of the history of the SDK to lead into where we are going in the future, and provide some of the rationale behind that map.

The Sentinel SDK has been available to the public for many years, and has been evolving during that time to simplify the setup, provide additional capabilities to developers, and generally improve the base/boilerplate code that powers all of the plugins within Sentinel based on customer feedback; the 2011.1 SDK beta a few years ago had a lot of big changes to make the SDK setup a simple process that anybody can do instead of ten pages of steps. The documentation for collector plugins has existed throughout this time, and the format of the documentation has always been a template file provided by default when the collector plugin was started in the SDK, with some fields being pulled in from other parts of the collector at build time. Perhaps an example will help explain this.

In the 2011.1, as well as the 2014 Preview, SDK when building a new collector plugin there is a release.odt file which is a stock file with generic information that applies to collectors in general. This is editable using a proper document editor (LibreOffice, OpenOffice, or anything else supporting the public OpenDocument format) to add specific information for the current collector plugin. There are also document variables contained within this file which are automatically filled-in whenever the document is built; for example:

@AUTHOR@ - Replaced with the plugin's author
@PLUGIN@ - Replaced with the name of the plugin.
@UUID@ - Replaced with the plugin's UUID


During the build process for a collector there used to be a prompt that would ask something along the lines of, "Do you want to build documentation?" If you chose 'yes', and if you had your system configured properly (LibreOffice or OpenOffice installed, and referenced within the SDK plugin) the build process would use the release.odt file, plus a bunch of settings from the collector plugin that was being developed, and merge everything into a nice single document on the fly, creating a PDF and embedding it where necessary. The result is that a Production build of the collector automatically had (as much as possible) up-to-date information on the collector itself; connection methods were in there and updated, names of plugins, authors, vendors, your customized description of the plugin updated five minutes go per the customer's request, etc. For more details on how things used to be, see the Plug-In Documentation page found at https://www.novell.com/developer/plugin-sdk/sdk_documentation.html of the SDK web-based documentation. Because this process called an external process and then did a lot of mangling of information in release.odt within the document, it was a bit slow compared to the rest of the build process. Also, the PDF that was built was stored along with the collector, but usually has not been the one available within the plugin, and therefore accessible to a Sentinel user within ESM who was given a different version of mostly-boilerplate information. Finally, PDF is a great format, but it requires a special client of some sort to read it, where other formats can be read by just about anything. This all leads up to changes in the latest SDK.

Today all of that is still present, but you do not need it anymore. If you want to use it, particularly if you are upgrading to using the 2014 Preview SDK with a collector plugin created from a previous version of the SDK which already has documentation that you are not yet ready to recreate, you can enable building of the old style of documentation by modifying ~/.netiq/pluginsdk/oo.properties as the build output states:

_build-externaldoc:

[echo] External PDF will not be created because the variable build.pdf
in ~/.netiq/pluginsdk/oo.properties has been set to no.

Looking in the file mentioned the option to enable this setting, or prompt for it, or specify an alternate path for soffice, is all pretty obvious:

soffice.path.value=/usr/bin/
prompt.for.oo=no
build.pdf=no
macro.already.installed=yes


Now that we know how to do the old way if needed, let's talk about the new way. My opinion of the drawbacks to be overcome with the old way were requirements for external software (LibreOffice/OpenOffice), slowness, inconsistent documentation within the plugin vs. what was distributed separately, and limitations around the formats used (PDF in particular). Keeping in mind the need to keep everything properly cross-platform, the only options for some sort of document template and merge system other than the LibreOffice/OpenOffice format would be something like HTML which can be rich in nature but is completely able to be hacked with standard markup tools (WYSIWYG editors, text editors, XSLT and XML tools, etc.). Removing this dependency also removes some slowness due to lighter-weight applications being loaded, and the removal of the PDF format. As long as the documentation that was built was placed both inside the plugin as well as alongside it in the build directory, all problems are solved. Tada, here we are today.

The new way, using HTML as the final format, should be accessible anywhere... mobile phone, dedicated monitoring system, laptop, tablet; really what cannot open and render HTML pretty well these days? There was a potential downside to HTML in my mind, and that was the traditional use of dozens of files to support a single final product. For example, look at how many HTTP request are made by your web browser the next time you hit a website, and you'll see that there are images, CSS files, JS files, etc. all being roped in to make the final display what the author expects. While CSS and JS are regularly embedded, what about images? It turns out that while images are being avoided generally (big, harder to render on smaller screens, overused and providing a disproportionately small amount of value compared to size and other downsides) it is possible to embed an image entirely in HTML, so this was good news for me. Since it is useful information generally, let's start there.

For a quick read on the topic outside of the Sentinel world, check out this link from StackOverflow: http://stackoverflow.com/questions/1207190/embedding-base64-images (Embedding Base64 Images) The technology is known as Data URIs, is described on Wikipedia with some quick details including browser compatibility http://en.wikipedia.org/wiki/Data_URI_scheme and solves our problem nicely. There are some good reasons to NOT use these in normal web-based content, but this is neither normal nor web-based, so we'll make an exception. Note that adding images or other heavy content to a page will make it load more-slowly, as always, no matter how it is roped in for use in the client.

The basic look of the <img/> tag changes from something like this:

<img src="path/to/image.png"/>

to this:
<img src="alt="Embedded Image" src=""/>


Note that the src field includes the string 'data:', a colon (:), a MIME type (image/png), semicolon (;), a format (base64), one more comma (,), and then the base64-encoded image (obviously truncated here for brevity). The end result when putting these images somewhere in the HTML file is you end up with long lines of base64-encoded data, but it makes something otherwise impossible able to be done pretty easily. As an example, pretend you have decided to include a screenshot of the application vendor's logo, or maybe your customer's/employer's logo, and you want that somewhere in the final documentation to provide something extra and personalized. The first step is to get the image, if not available already, into a format that is made for the web; Portable Network Graphics (PNG) format is a good option, as is Scalar Vector Graphics (SVG), or maybe Joint Photographic Experts Group (JPEG). Once the file is available the next step is to base64-encode it and determine the proper MIME designation (image/png, image/svg+xml, or image/jpeg). Base64-encoding is trivial most of the time, and can be done using something like the following on any old server you have:

base64 /path/to/image/file.png | tr -d '\n'


The line above uses the built-in base64 command, giving it a file to encode, and then strips out newlines in the resulting output so only one big long string of base64-encoded data remains. The reason for the newlines is mostly convention; it is common for encodings like base64 to be "folded" after so many characters (78 usually) to fit within the size of an old-fashioned terminal window nicely, and to avoid pain and suffering in today's powerful editors that will otherwise try to show you the millionth character in a megabyte-long string of data. Still, for our purposes with data URIs we want one big long string of stuff, so we send the encoded data through 'tr' telling it to delete newlines, and now we're ready.

To provide a workable example with a known file, let's take SUSE's logo from Wikipedia: http://upload.wikimedia.org/wikipedia/en/b/b5/Suse_logo_w-tag_color.png Pull down the file, encode it as shown above (or with another base64-encoding mechanism of your choosing) and then create the data URI that results:

curl -O http://upload.wikimedia.org/wikipedia/en/b/b5/Suse_logo_w-tag_color.png
base64 Suse_logo_w-tag_color.png | tr -d '\n'


The resulting data URI should look like this:

<img src="alt="SUSE Logo Embedded Image" src=""/>


For those interested in the details about encoding (such as base64-encoding) the original file size was 11,989 bytes, and the encoded size is now 15,988 bytes which is 1.3x the original size, as is normal with base64-encoding since the encoding standard uses four bytes to represent three of the original bytes, expanding the total size by 1/3 or 33%. This is probably another reason why doing this on the web generally is not advisable.

Moving past the specifics of the data URI we can get into how to build the actual documentation. Within the collector plugin in Eclipse there is a 2011.1/docs/html directory which is where the customized HTML files will go before being joined together to make one large document holding the final documentation. By default this directory has three files: 'Known Issues.html', 'Release Notes.html', and 'Sections.seq'. The first two are basic HTML files, mostly empty, which can be customized with appropriate, plugin-specific content per their titles. Examples for Release Notes could include assumptions made during development, or maybe a bunch of documentation on how to setup the application to send the correct data per the collector's requirements, or future-looking statements about what may come in the next version of the plugin, or how to potentially handle newer version of the application that may otherwise be unsupported. Known Issues, of course, mentions known issues with the plugin such as limits on types of data, or event rates, or other limitations found during testing. For a first release both of these may be blank other than to indicate this is the first release.

The really interesting part is that Sections.seq file, which has a single commented-out line within to give some guidance on its use:

~~~Existing Index Section Name,New Index Section Name(optional),New Index Section Indent(optional),HTML resource


While this is maybe something that makes sense to those familiar with the purpose of the file, in reality this is not enough, or meant to be enough, to get going with expanding the documentation. In fact, these three files have under 1 KiB of information combined, so obviously a lot more is happening that needs to be covered. As a starter, the 2014 Preview documentation has some information on (currently) page 96 in the section titled 'HTML Plug-Iin Documentation'. The documentation briefly discusses 'HTML Document Fragments', which is basically what we are seeing with the Release Notes and Known Issues files which can be edited via any HTML or text editor, depending on the user's preference and ability.

Just to show what happens by default, go ahead and modify the Release Notes and Known Issues files and add some searchable content to them keeping the HTML nature of the documents intact. Run a build from Eclipse by right-clicking on the collector's '2011.1' and choosing 'Create Development Build'. Assuming the build completes properly, somewhere near the end should be a line like the one bolded below:

    [echo] Individual HTML fragements substitution in skeleton doc complete
[echo] HTML Document generated successfully
[copy] Copying 1 file to /home/ab/code/sentinel-plugin-sdk/content/build/Collector/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test/plugin/docs
[move] Moving 1 file to /home/ab/code/sentinel-plugin-sdk/content/build/Collector/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test
[echo] HTML Document generated at : /home/ab/code/sentinel-plugin-sdk/content/build/Collector/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test.html
[echo] ABSoftware_ABProduct_2011.1r1-201412241602-internal-test.html should now be available in the build directory
[xmltask] Setting standalone

_package-plugin:

_prepackage-plugin:
[xmltask] Setting standalone
[zip] Building zip: /home/ab/code/sentinel-plugin-sdk/content/build/Collector/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test/ABSoftware_ABProduct_2011.1r1-201412241602-internal-test.clz.zip
BUILD SUCCESSFUL
Total time: 3 seconds


Notice a few things here, specifically that the build completed successfully and that the SDK informed you as much at the very end in capital letters; no ambiguity with the build-time success or failure at least. Also, notice that the HTML document is generated and the path to access it is printed out for easy access. Copying this path and dropping it into a regular web browser should show the standard documentation, which should now include your little edits.

The documentation generated by default creates a nice page with a table of contents and information on this particular plugin's connection methods and other properties. Like the ODT-based documentation, a lot of base information is present for any collector but it is intended that the developer add and customize information a bit, which is where the 'seq' files come in. The two sequence (seq) files tell the build process in what order to place the included HTML files, and what "indent' level, so that the document is structured as the author intends. The original Document.seq file looks like this, as taken from sentinel-plugin-sdk/current/sdk/2011.1/Collector/static/docs/html/Document.seq:

~~~Tabs(optional),Index Section Name,HTML resource
,Legal Notice,legal.html
,About This Guide,About This Guide.html
,Audience,Audience.html
,Contacting Sales Support,Contacting Sales Support.html
,Contacting the Online User Community,Contacting the Online User Community.html
,Additional Documentation,Additional Documentation.html
,Documentation Conventions,Documentation Conventions.html
,Collector Overview,Collector Overview.html
,Supported Products,Supported Products.xsl
,Connection Methods,Connection Methods.xsl
,Configuration,Configuration.html
,Observer Configuration,Observer Configuration.html
,Collector Configuration Options,Collector Configuration Options.html
,Predefined Collector Parameters,Predefined Collector Parameters.xsl
,Configuring Time Zone,Configuring Time Zone.html
,Upgrade Procedures,Upgrade Procedures.html
,Collector Customization,Generic Collector Customization.html
,Troubleshooting,Troubleshooting.html
,Revision History,Revision History.html
,Known Issues,Known Issues.html
,Release Notes,Release Notes.html


As you can see there are a lot of sections in place already, as each .html file is also present as part of the default SDK. In that same directory are the following files, just to provide a current list with dates and sizes:

-rw-r--r-- 1 ab users  442 Aug 12 13:00 Collector Configuration Options Head.html
-rw-r--r-- 1 ab users 462 Aug 12 13:00 Collector Configuration Options.html
-rw-r--r-- 1 ab users 175 Aug 12 13:00 Collector Overview.html
-rw-r--r-- 1 ab users 4483 Aug 12 13:00 Collector Pack.html
-rw-r--r-- 1 ab users 5772 Aug 12 13:00 Collector Parsing.html
-rw-r--r-- 1 ab users 1639 Aug 12 13:00 Configuration.html
-rw-r--r-- 1 ab users 2704 Aug 12 13:00 Configuring Time Zone.html
-rw-r--r-- 1 ab users 1276 Aug 12 13:00 Connection Methods.html
-rw-r--r-- 1 ab users 5024 Aug 12 13:00 Connection Methods.xsl
-rw-r--r-- 1 ab users 1061 Aug 12 13:00 Document.seq
-rw-r--r-- 1 ab users 1073 Aug 12 13:00 ESM Component Configuration.html
-rw-r--r-- 1 ab users 612 Aug 12 13:00 Exploit Detection Advisor.html
-rw-r--r-- 1 ab users 1966 Aug 12 13:00 Generic Collector Customization.html
-rw-r--r-- 1 ab users 458 Aug 12 13:00 Integration Testing.html
-rw-r--r-- 1 ab users 840 Aug 12 13:00 Observer Configuration.html
-rw-r--r-- 1 ab users 7396 Aug 12 13:00 Predefined Collector Parameters.xsl
-rw-r--r-- 1 ab users 114 Aug 12 13:00 Revision History.html
-rw-r--r-- 1 ab users 219 Aug 12 13:00 Sections.seq
-rw-r--r-- 1 ab users 1255 Aug 12 13:00 Supported Data Sources.html
-rw-r--r-- 1 ab users 1932 Aug 12 13:00 Supported Products.xsl
-rw-r--r-- 1 ab users 3410 Aug 12 13:00 Troubleshooting.html
-rw-r--r-- 1 ab users 5244 Aug 12 13:00 Upgrade Procedures.html


It may also be useful to note that some of these files do not end in .html, but use .xsl instead. As these are XSLT files (XML stylesheets) their intention is not to be used directly but instead to be executed against data in the dev environment created by the collector's developer. It is in this way that dynamic settings from the collector build process, such as application descriptions, connection methods, etc. are all pulled into the documentation at build time. Since an XSLT file needs some kind of input, note that the package.xml, as generated during the build process, is used directly for this input. As a result, to have something dynamically roped into the documentation it needs to exist in the package.xml file.

It is possible, likely even, that some of these default sections are not exactly what you want to have in your custom documentation for a collector. To provide options for this scenario NetIQ has provided the ability to override any of these files, including the Document.seq and Sections.seq files, by placing exactly-named replacement files in the collector's development directory structure. Currently the 2011.1 directory that shows up in Eclipse has several files and a few directories underneath, and the 'docs' directory is, appropriately, where we look for documentation details. Within there is an 'html' directory as mentioned above, and while there are only three files by default more can be added, including copying originals from the main SDK directories for modification on a per-plugin basis. This is particularly neat because it means you can take advantage of what comes by default from NetIQ which is likely applicable to collectors worldwide, but then modify different pieces to meet your own needs, even if that means appending to an existing section without modifying the index (via the Sections.seq file), adding new main sections or subsections (Sections.seq or Document.seq, depending on needs), or removing sections that are not wanted (Document.seq probably). A recent example I had was a request to document, with screenshots, all of the types of events for several custom collectors. Even though they were entirely customized from this SDK, I used a lot of the boilerplate files, customized a few to better meet my needs, and then added a few of my own with a lot of information that did not easily fit in the existing document. Because many of these changes were company, not plugin, specific, I was able to reuse those fragment documents in multiple collectors to further save time. Basically we see all of the benefits programmers have seen for decades in breaking up source files to relevant components, but in documentation.

The Sections.seq file looks like this by default:

~~~Existing Index Section Name,New Index Section Name(optional),New Index Section Indent(optional),HTML resource


Note that the first and last fields are required in this file. The middle fields are only required if you are creating a new subsection, and if you want that new subsection to be indented relative to the original section specified in the first field, respectively.

A few rules surrounding overrides are important to know:

  1. An override (the file overriding the original file) must exist in the plugin's documentation directory (2011.1/docs/html) as mentioned above. The build process needs to find it, and this is where it looks, so put it there. I have not tried using symlinks yet to see if I can put a common file somewhere of my own and then link to it from many plugins' 2011.1/docs/html directory, bu that's on my list of things to try. If symlinking does not work, hard-linking certainly should.


  2. The overriding file must have the exact same name as the overridden file. If off at all, keeping in mind we're dealing with computers and computers are case-sensitive by default, the override will not work. Similarly, if the Sections.seq file is used to add to an existing section, or add a subsection, the reference to an existing section must be exactly correct per the original section's name.


  3. There are some limitations around what you can do in a Sections.seq file, and other limitations in a Document.seq file. You could probably get away with just having a Document.seq file and defining everything within and modify everything everywhere, but it appears that NetIQ anticipates that most of the we will either override an HTML file completely or else we'll append to it with Sections.seq. Between the two files, there are a lot of great options, and of course it's best to redefine as little as possible to take advantage of SDK improvements in the future as seamlessly as possible.


  4. In the Document.seq file the first field is a Tabs field, which indicates how indented the defined section (the one on the current row) is from the root of the document. Only tab characters (0x09) matter and all other whitespace characters are ignored. Also note that a subsection can only ever be indented by one tab more than its parent section, so trying to go from something with zero tabs to something Indented by two tabs is not allowed without an intermediate section indented by one tab. This mostly impacts the created document index and links within the document, as the actual sections of the document are not somehow indented.


For the record, the following are the default locations for HTML files for those plugins which support them:

collectors - sdk/2011.1/Collector/static/docs/html
actions - sdk/2011.1/Action/static/docs/html
reports - sdk/2011.1/Report/static/docs/html
solutions -sdk/2011.1/Solution/static/docs/html


Working on formatting within document sections there are a few common things which may be desirable to know, such as inserting a Note to the reader which stands out as such. For this, the built-in stylesheets (CSS, not XSLT) have a class of 'note' which will properly format things to stand out as a note. Also, because we're dealing with Information Technology (IT), it is likely we'll want to paste code or something formatted like code in our documentation. Surprisingly enough, there is a 'code' class that can be applied to HTML elements just like the 'note' class, so random div/p/span tags can have content formatted appropriately with very little work on our part.

There is more to cover, but this was supposed to be a quick review so perhaps more for later. Some other ideas that have come up after working through this last use of the SDK include using the symlinks (or hardlinks) for vendor-specific things, to try to minimize the need for plugin-specific alterations by instead having one document that is referenced by all of the plugins-specific directories. Another idea may be to do some outside-the-SDK generation of data from source files, particularly when doing images that may update frequently. Getting screenshot for 100 types of events is painful, and then encoding them and placing them in the appropriate files for the build process to consume is not exactly an exercise in anything other than repetition, so creating a quick script to automate that would be worth doing in the future, especially if there is any chance of the events changing as they often do during development.

Happy Computing!

DISCLAIMER:

Some content on Community Tips & Information pages is not officially supported by Micro Focus. Please refer to our Terms of Use for more detail.
Top Contributors
Version history
Revision #:
1 of 1
Last update:
‎2015-01-15 23:29
Updated by:
 
The opinions expressed above are the personal opinions of the authors, not of Micro Focus. By using this site, you accept the Terms of Use and Rules of Participation. Certain versions of content ("Material") accessible here may contain branding from Hewlett-Packard Company (now HP Inc.) and Hewlett Packard Enterprise Company. As of September 1, 2017, the Material is now offered by Micro Focus, a separately owned and operated company. Any reference to the HP and Hewlett Packard Enterprise/HPE marks is historical in nature, and the HP and Hewlett Packard Enterprise/HPE marks are the property of their respective owners.