Copyright, URL & author detection agent

Reports:

  • All the copyrights for Package A
  • All the files in package A that have copyrights for Mary Doe
  • With a historical repository, Find all the files that have had their copyrights changed
  • All files with no copyrights
  • All email addresses
  • All url's

Issues:

  • LICENSE and NOTICE files list multiple licenses and copyrights for different parts of a package. This isn't necessarily a problem, but it means that it would be difficult to programmatically extract what copyrights go with which part of the package. So granularity below the package level will be difficult.