Skip to content

012. July 16 to July 20

aradu12 edited this page Jul 23, 2018 · 9 revisions

Planned tasks for this week

  • task 1: create scripts for data processing ✔️
  • task 2: figuring out if Gumtree can give us qualified api names ✔️
  • task 3: continue working on a comparison summary with this paper and others we've looked at during this summer
  • task 4: finish fixing graphs ⌛️

Progress

task 1

  • wrote several scripts to get
    • number of problems by type, language, tag, stars, or API
    • code here
    • can write more as needed
    • update: added some input checking and made tag and range custom
  • run client code using python3 DataBoxClient.py
    • need to have a yaml parser installed -- see links at top of DataBox class in folder

task 2

  • Solved by Dr. Nadi
  • mapping nodes via NodeFinder is simple, so we can create a script to automate finding api changes
  • wrote a script to get API changes between files, adapted from sand's file
    • code here
    • also see notes in the comments

task 3

task 4

  • fixed the zoom on main graphs graphs
  • currently creating graphs with outliers removed
  • also planning to swap .png files with code to make life easier

Other

  • moved general practise problems to their own directory to make data more organized and easier to parse (i.e., we now have general-practise, api-related, project-specific as subdirectories of problems)

Open Problems

  • new issues opened this week:

Things we discussed/agreed on

  • exploring gumtree mapping -- done; see task 2
  • looking at well-engineered projects to expand data
  • adding custom range options to DataBox -- done
  • adding more methods to DataBox
  • presenting at PLSE

Next steps

  • expand data using well-engineered projects
  • finish fixing graphs
  • continue paper