Skip to content

004. May 22 to May 25

aradu12 edited this page Jul 4, 2018 · 1 revision

Planned tasks for this week

  • task 1: update existing data to include API, correct-usage and/or 'rule' ✔️
  • task 2: collect more data using the techniques we discussed (stemming,filtering,issues etc.) ✔️
  • task 3:

Progress

task 1

  • added relevant api and general rule to as many problems as possible; some were harder to generalize
  • added 'suggestion' coding practices to problems not specifically tied to an api
  • separated problems related to api use from ones with 'suggestions'

task 2

  • edited mining code to include stem words and filter commits to .java files
  • mined some more repos and put the data on github

Open Problems

Things we discussed/agreed on

  • filtering commit messages to those affecting .java files and using stemming
  • using issues to find misuses
  • separating project-specific misuses from project-independent ones
  • adding API and 'correct usage' info to data
  • adding a general 'rule' message to data
  • removed the data from the 'plaid' repo that was irrelevant

Next steps

  • mine more repos using filtering and linking to issues