Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix exclusive hardware control mode switching on controller failed activation #1522

Open
wants to merge 20 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
20 commits
Select commit Hold shift + click to select a range
960010d
added an actuator interface with exclusive interface handling
Apr 19, 2024
141887e
ran precommit
Apr 19, 2024
2850e98
renamed into test_actuator_exclusive_interfaces
Apr 19, 2024
567d5ff
formatting fix
Apr 19, 2024
2015820
added a test controller that fails at activation
Apr 19, 2024
bcd7bf8
forgot to run pre-commit
Apr 19, 2024
6f2b361
Fix issues in the setup of the test controller that fails on activate
saikishor Apr 29, 2024
3b10a7f
Added test to reproduce the failure of switching with exclusive inter…
saikishor Apr 29, 2024
925cc25
Throw an exception when requesting already claimed exclusive interface
saikishor Apr 29, 2024
deaaebf
add the perform switching logic to fix the issue with exclusive inter…
saikishor Apr 29, 2024
a50fd87
remove the scoping
saikishor May 2, 2024
e48bdfd
Collect all failed activated controllers and perform command switch a…
saikishor May 13, 2024
3ba484f
added some documentation about the determinism
saikishor May 13, 2024
4a09800
add naming changes from state to lifecycle_state
saikishor Sep 26, 2024
01beeea
remove copy pasta remnants from comments
bmagyar Jan 4, 2025
9f56e52
Fix pre-commit
saikishor Jan 6, 2025
5b1771d
Remove visibility macros from the test controller
saikishor Jan 6, 2025
381e277
Fix the switch return to ERROR due to recent change in return state o…
saikishor Jan 6, 2025
3fa67f3
Update controller_manager/doc/userdoc.rst
saikishor Jan 6, 2025
bc6ce45
Merge branch 'master' into fix/exclusive_hw_interface_switching
saikishor Jan 7, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions controller_manager/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -182,13 +182,28 @@ if(BUILD_TESTING)
DESTINATION lib
)

add_library(test_controller_failed_activate SHARED
test/test_controller_failed_activate/test_controller_failed_activate.cpp
)
target_link_libraries(test_controller_failed_activate PUBLIC
controller_manager
)
target_compile_definitions(test_controller_failed_activate PRIVATE "CONTROLLER_MANAGER_BUILDING_DLL")
pluginlib_export_plugin_description_file(
controller_interface test/test_controller_failed_activate/test_controller_failed_activate.xml)
install(
TARGETS test_controller_failed_activate
DESTINATION lib
)

ament_add_gmock(test_release_interfaces
test/test_release_interfaces.cpp
APPEND_ENV AMENT_PREFIX_PATH=${ament_index_build_path}_$<CONFIG>
)
target_link_libraries(test_release_interfaces
controller_manager
test_controller_with_interfaces
test_controller_failed_activate
ros2_control_test_assets::ros2_control_test_assets
)

Expand Down
6 changes: 6 additions & 0 deletions controller_manager/doc/userdoc.rst
Original file line number Diff line number Diff line change
Expand Up @@ -383,3 +383,9 @@ Hardware and Controller Errors

If the hardware during it's ``read`` or ``write`` method returns ``return_type::ERROR``, the controller manager will stop all controllers that are using the hardware's command and state interfaces.
Likewise, if a controller returns ``return_type::ERROR`` from its ``update`` method, the controller manager will deactivate the respective controller. In future, the controller manager will try to start any fallback controllers if available.

Factors that affect Determinism
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
When run under the conditions determined in the above section, the determinism is assured up to the limitations of the hardware and the real-time kernel. However, there are some situations that can affect determinism:

* When a controller fails to activate, the controller_manager will call the methods ``prepare_command_mode_switch`` and ``perform_command_mode_switch`` to stop the started interfaces. These calls can cause jitter in the main control loop.
40 changes: 28 additions & 12 deletions controller_manager/src/controller_manager.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1727,6 +1727,7 @@ void ControllerManager::activate_controllers(
const std::vector<ControllerSpec> & rt_controller_list,
const std::vector<std::string> controllers_to_activate)
{
std::vector<std::string> failed_controllers_command_interfaces;
for (const auto & controller_name : controllers_to_activate)
{
auto found_it = std::find_if(
Expand Down Expand Up @@ -1835,35 +1836,38 @@ void ControllerManager::activate_controllers(
}
controller->assign_interfaces(std::move(command_loans), std::move(state_loans));

auto new_state = controller->get_lifecycle_state();
try
{
found_it->periodicity_statistics->Reset();
found_it->execution_time_statistics->Reset();
const auto new_state = controller->get_node()->activate();
if (new_state.id() != lifecycle_msgs::msg::State::PRIMARY_STATE_ACTIVE)
{
RCLCPP_ERROR(
get_logger(),
"After activation, controller '%s' is in state '%s' (%d), expected '%s' (%d).",
controller->get_node()->get_name(), new_state.label().c_str(), new_state.id(),
hardware_interface::lifecycle_state_names::ACTIVE,
lifecycle_msgs::msg::State::PRIMARY_STATE_ACTIVE);
}
new_state = controller->get_node()->activate();
}
catch (const std::exception & e)
{
RCLCPP_ERROR(
get_logger(), "Caught exception of type : %s while activating the controller '%s': %s",
typeid(e).name(), controller_name.c_str(), e.what());
controller->release_interfaces();
continue;
}
catch (...)
{
RCLCPP_ERROR(
get_logger(), "Caught unknown exception while activating the controller '%s'",
controller_name.c_str());
}
if (new_state.id() != lifecycle_msgs::msg::State::PRIMARY_STATE_ACTIVE)
{
RCLCPP_ERROR(
get_logger(),
"After activation, controller '%s' is in state '%s' (%d), expected '%s' (%d). Releasing "
"interfaces!",
controller->get_node()->get_name(), new_state.label().c_str(), new_state.id(),
hardware_interface::lifecycle_state_names::ACTIVE,
lifecycle_msgs::msg::State::PRIMARY_STATE_ACTIVE);
controller->release_interfaces();
failed_controllers_command_interfaces.insert(
failed_controllers_command_interfaces.end(), command_interface_names.begin(),
command_interface_names.end());
continue;
}

Expand All @@ -1875,6 +1879,18 @@ void ControllerManager::activate_controllers(
resource_manager_->make_controller_reference_interfaces_available(controller_name);
}
}
// Now prepare and perform the stop interface switching as this is needed for exclusive
// interfaces
if (
!failed_controllers_command_interfaces.empty() &&
(!resource_manager_->prepare_command_mode_switch({}, failed_controllers_command_interfaces) ||
!resource_manager_->perform_command_mode_switch({}, failed_controllers_command_interfaces)))
{
RCLCPP_ERROR(
get_logger(),
"Error switching back the interfaces in the hardware when the controller activation "
"failed.");
}
}

void ControllerManager::activate_controllers_asap(
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,66 @@
// Copyright 2021 Department of Engineering Cybernetics, NTNU.
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

#include "test_controller_failed_activate.hpp"

#include <memory>
#include <string>

#include "lifecycle_msgs/msg/transition.hpp"

namespace test_controller_failed_activate
{
TestControllerFailedActivate::TestControllerFailedActivate()
: controller_interface::ControllerInterface()
{
}

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn
TestControllerFailedActivate::on_init()
{
return rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn::SUCCESS;
}

controller_interface::return_type TestControllerFailedActivate::update(
const rclcpp::Time & /*time*/, const rclcpp::Duration & /*period*/)
{
return controller_interface::return_type::OK;
}

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn
TestControllerFailedActivate::on_configure(const rclcpp_lifecycle::State & /*previous_state&*/)
{
return rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn::SUCCESS;
}

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn
TestControllerFailedActivate::on_activate(const rclcpp_lifecycle::State & /*previous_state&*/)
{
// Simply simulate a controller that can not be activated
return rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn::FAILURE;
}

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn
TestControllerFailedActivate::on_cleanup(const rclcpp_lifecycle::State & /*previous_state*/)
{
return rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn::SUCCESS;
}

} // namespace test_controller_failed_activate

#include "pluginlib/class_list_macros.hpp"

PLUGINLIB_EXPORT_CLASS(
test_controller_failed_activate::TestControllerFailedActivate,
controller_interface::ControllerInterface)
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
// Copyright 2020 Department of Engineering Cybernetics, NTNU
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

#ifndef TEST_CONTROLLER_FAILED_ACTIVATE__TEST_CONTROLLER_FAILED_ACTIVATE_HPP_
#define TEST_CONTROLLER_FAILED_ACTIVATE__TEST_CONTROLLER_FAILED_ACTIVATE_HPP_

#include <memory>
#include <string>

#include "controller_manager/controller_manager.hpp"

namespace test_controller_failed_activate
{
// Corresponds to the name listed within the pluginglib xml
constexpr char TEST_CONTROLLER_WITH_INTERFACES_CLASS_NAME[] =
"controller_manager/test_controller_failed_activate";
// Corresponds to the command interface to claim
constexpr char TEST_CONTROLLER_COMMAND_INTERFACE[] = "joint2/velocity";
class TestControllerFailedActivate : public controller_interface::ControllerInterface
{
public:
TestControllerFailedActivate();

virtual ~TestControllerFailedActivate() = default;

controller_interface::InterfaceConfiguration command_interface_configuration() const override
{
return controller_interface::InterfaceConfiguration{
controller_interface::interface_configuration_type::INDIVIDUAL,
{TEST_CONTROLLER_COMMAND_INTERFACE}};
}

controller_interface::InterfaceConfiguration state_interface_configuration() const override
{
return controller_interface::InterfaceConfiguration{
controller_interface::interface_configuration_type::NONE};
}

controller_interface::return_type update(
const rclcpp::Time & time, const rclcpp::Duration & period) override;

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn on_init() override;

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn on_configure(
const rclcpp_lifecycle::State & previous_state) override;

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn on_activate(
const rclcpp_lifecycle::State & previous_state) override;

rclcpp_lifecycle::node_interfaces::LifecycleNodeInterface::CallbackReturn on_cleanup(
const rclcpp_lifecycle::State & previous_state) override;
};

} // namespace test_controller_failed_activate

#endif // TEST_CONTROLLER_FAILED_ACTIVATE__TEST_CONTROLLER_FAILED_ACTIVATE_HPP_
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
<library path="test_controller_failed_activate">

<class name="controller_manager/test_controller_failed_activate" type="test_controller_failed_activate::TestControllerFailedActivate" base_class_type="controller_interface::ControllerInterface">
<description>
Controller used for testing
</description>
</class>

</library>
83 changes: 83 additions & 0 deletions controller_manager/test/test_release_interfaces.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@
#include "controller_manager/controller_manager.hpp"
#include "controller_manager_test_common.hpp"
#include "lifecycle_msgs/msg/state.hpp"
#include "test_controller_failed_activate/test_controller_failed_activate.hpp"
#include "test_controller_with_interfaces/test_controller_with_interfaces.hpp"

using ::testing::_;
Expand All @@ -29,6 +30,19 @@ class TestReleaseInterfaces : public ControllerManagerFixture<controller_manager
{
};

class TestReleaseExclusiveInterfaces
: public ControllerManagerFixture<controller_manager::ControllerManager>
{
public:
TestReleaseExclusiveInterfaces()
: ControllerManagerFixture<controller_manager::ControllerManager>(
std::string(ros2_control_test_assets::urdf_head) +
std::string(ros2_control_test_assets::hardware_resources_with_exclusive_interface) +
std::string(ros2_control_test_assets::urdf_tail))
{
}
};

TEST_F(TestReleaseInterfaces, switch_controllers_same_interface)
{
std::string controller_type =
Expand Down Expand Up @@ -197,3 +211,72 @@ TEST_F(TestReleaseInterfaces, switch_controllers_same_interface)
abstract_test_controller2.c->get_lifecycle_state().id());
}
}

TEST_F(TestReleaseExclusiveInterfaces, test_exclusive_interface_switching_failure)
{
std::string controller_type =
test_controller_failed_activate::TEST_CONTROLLER_WITH_INTERFACES_CLASS_NAME;

// Load two controllers of different names
std::string controller_name1 = "test_controller1";
std::string controller_name2 = "test_controller2";
ASSERT_NO_THROW(cm_->load_controller(controller_name1, controller_type));
ASSERT_NO_THROW(cm_->load_controller(
controller_name2, test_controller_with_interfaces::TEST_CONTROLLER_WITH_INTERFACES_CLASS_NAME));
ASSERT_EQ(2u, cm_->get_loaded_controllers().size());
controller_manager::ControllerSpec abstract_test_controller1 = cm_->get_loaded_controllers()[0];
controller_manager::ControllerSpec abstract_test_controller2 = cm_->get_loaded_controllers()[1];

// Configure controllers
ASSERT_EQ(controller_interface::return_type::OK, cm_->configure_controller(controller_name1));
ASSERT_EQ(controller_interface::return_type::OK, cm_->configure_controller(controller_name2));

ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_INACTIVE,
abstract_test_controller1.c->get_lifecycle_state().id());
ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_INACTIVE,
abstract_test_controller2.c->get_lifecycle_state().id());

{
// Test starting the first controller
// test_controller1 activation always fails
RCLCPP_INFO(cm_->get_logger(), "Starting controller #1");
std::vector<std::string> start_controllers = {controller_name1};
std::vector<std::string> stop_controllers = {};
auto switch_future = std::async(
std::launch::async, &controller_manager::ControllerManager::switch_controller, cm_,
start_controllers, stop_controllers, STRICT, true, rclcpp::Duration(0, 0));
ASSERT_EQ(std::future_status::timeout, switch_future.wait_for(std::chrono::milliseconds(100)))
<< "switch_controller should be blocking until next update cycle";
ControllerManagerRunner cm_runner(this);
EXPECT_EQ(controller_interface::return_type::ERROR, switch_future.get());
ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_INACTIVE,
abstract_test_controller1.c->get_lifecycle_state().id());
ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_INACTIVE,
abstract_test_controller2.c->get_lifecycle_state().id());
}

{
// Test starting the second controller, interfaces should have been released
// test_controller2 always successfully activates
RCLCPP_INFO(cm_->get_logger(), "Starting controller #2");
std::vector<std::string> start_controllers = {controller_name2};
std::vector<std::string> stop_controllers = {};
auto switch_future = std::async(
std::launch::async, &controller_manager::ControllerManager::switch_controller, cm_,
start_controllers, stop_controllers, STRICT, true, rclcpp::Duration(0, 0));
ASSERT_EQ(std::future_status::timeout, switch_future.wait_for(std::chrono::milliseconds(100)))
<< "switch_controller should be blocking until next update cycle";
ControllerManagerRunner cm_runner(this);
EXPECT_EQ(controller_interface::return_type::OK, switch_future.get());
ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_INACTIVE,
abstract_test_controller1.c->get_lifecycle_state().id());
ASSERT_EQ(
lifecycle_msgs::msg::State::PRIMARY_STATE_ACTIVE,
abstract_test_controller2.c->get_lifecycle_state().id());
}
}
10 changes: 6 additions & 4 deletions hardware_interface_testing/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -21,12 +21,14 @@ foreach(Dependency IN ITEMS ${THIS_PACKAGE_INCLUDE_DEPENDS})
endforeach()

add_library(test_components SHARED
test/test_components/test_actuator.cpp
test/test_components/test_sensor.cpp
test/test_components/test_system.cpp)
test/test_components/test_actuator.cpp
test/test_components/test_sensor.cpp
test/test_components/test_system.cpp
test/test_components/test_actuator_exclusive_interfaces.cpp
)
ament_target_dependencies(test_components hardware_interface pluginlib ros2_control_test_assets)
install(TARGETS test_components
DESTINATION lib
DESTINATION lib
)
pluginlib_export_plugin_description_file(
hardware_interface test/test_components/test_components.xml)
Expand Down
Loading
Loading