FOSSology Project fundamentals:
This list is in priority order. (Owner of the task is listed in parenthesis, responsible for task breakdown and estimate). We don't have a schedule but are hoping to have this out by April. Without a schedule, this is just a guess and a hope. If you want to know why this takes as long as it does, just ask and have some time or money available to contribute.
(Mark) New heuristic based license analyzer (based on small phrases and phrases relative to other phrases). We are calling this analyzer
nomos.
Preliminary results show nomos to be 20x faster than bsam.
Need a way to add licenses to nomos without recompiling
(Bob) Implement
Buckets. This allows one to create categories used in license reports. For example, you could define categories like “good licenses”, “bad licenses”, “commercial licenses”, “files with no license”, …
Buckets mockup
Notes from 2/25 team meeting: 3-4 weeks (start date 2/26, est completion date 3/24)
Finish cascading buckets - 1 day; code checked in 2/26
Talk with Glen to understand env in the exec part of buckets - 1 day
Do we use a simple rule engine or rewrite for FO? - 5-10 days?
-
Modifications to include buckets in existing UI screen? (could Vincent do this task?) - 3 days (depends on if OSRB want modifications)
-
First pass makes a note of all the names that have to be changed in f1 and nomos.
Second pass verifies that the nomos changes are acceptable to OSRB.
Third pass is changing names in nomos and f1.
(Vincent) Package agents
pkg meta data (add debian to complement or replace the existing
spec file agent), identify packages, bin/src pkgs, and stated licenses
-
-
-
Notes from 2/25 team meeting: 4 days (start date 2/26, est completion date 3/3)
need to detect if dpkg is not installed. If so, disable the pkgagent.
make sure it can't be run via command line or the UI. Print an informative message!
test: run fossjobs
test: try it on fedora installs, both with & without dpkg installed
(DONE)default agents per user (Mark) Done!
Admin UI changes done. Testing & debugging - should be done by 2/19
(Mark) Create tests to validate buckets against nomos - 5 days (start date 2/26, est completion date 3/5)
Test will compare file, license, bucket from g-nomos to the results of f-nomos (Mark will try the verbose switch and if that doesn't work he will talk with Paul W and/or Glen)
need to verify the result at the package level
-
(DONE)
Report files with no license. This should be part of standard license reports, not just in Buckets. A mockup of the new license browser UI is here
-
(Bob) License Browser color coding
(Bob) Search within any part of a file tree
(Bob) Search for packages
(Bob, mark?) Browse packages
-
(Bob) Display license differences on a per file basis between versions of any archive (rpm, tar, etc)
-
Notes from 2/25 team meeting: 3 days (start date 2/26, est completion date 3/3)
NOTE: weather related power outage on 2/26 affected productivity.
where did he put the training files? put them in $DATADIR
locates copyright statements,
URL's & email addresses but only copyright is currently reported in the UI
data is currently being placed in copyright_test. Need to change the name.
UI is complete; need to move from Adam's project directory to the main trunk
-
-
Notes from 2/25 team meeting:
test all new FO1.2 features listed above
need to run Adam's copyright code
Creative Commons test files
(Mary) Address
bugs targeted for this release.
This list is to keep track and summarize changes needed in the UI that are the results of changing to the nomos agent as our primary license agent for 1.2.
(Mark) agent_license_once_compare should be deprecated and replaced by a nomos version.
(Mark) in the browse menu/page the Schedule license analysis link should be changed to reschedule nomos not bsam.
(Bob) include a warning message in license groups page indicating it will be deprecated and replaced with buckets.
All the mockups can be found here. Or just click on the individual mockups below:
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1.2 Notes
-
-
(Bob) Add capability for reanalysis without breaking data persistence ie. do new analysis without removing previous analysis results. This can be used, to compare new and old analysis results, and to insure that report url's are persistent.
(Bob) New scheduler
Modular plugins and agents. Supports optional plugin installs.
Improved multihost configuration and installation
(Scott Lamons) Identify binaries and where they came from.
Modify code to support the db server on a separate system. (This has always been a design goal but, has not been implemented correctly OR tested.)
-
This list captures “everything else” that we would like to work on but do not have scheduled, planned, or owned yet. This is the kitchen sink, as in “everything but the kitchen sink.”
High priority - within the next two releases
(Mary/Bob) Improve postgres error checking and error reporting
-
-
Deprecate (bSAM) license analyzer and licterms. They will still be available but not supported. To facilitate this, they could be separated into their own package.
-
(Bob) New machine learning license analyzer (based on sentence clustering). Currently we are calling this
F1. If there are results from both this analysis and fo_nomos, the results will be combined for reporting.
-
-
-
-
-
-
-
-
-
-
Low priority - could wait for 2-3 releases (or more)
-
-
-
-
-
-
-
-
-
-
Archived Reports - simple text file dump,
PDF reports, eventually full web archive of all analysis reports
-
-
-
-
-
-
-
-
See
Archive for a list of completed tasks